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FINGER CONDITIONING 
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There has been some discussion in the literature as to the 
part voluntary and involuntary reactions may play in the 
establishment of conditioned responses. In a recent article 
Marquis and Porter (7) suggest that conditioned retraction of 
the finger to shock may actually be a case of voluntary con- 
ditioning, and, in support of this, state that the response of 
the finger to shock is inclined to differ in many respects from 
most reflexes. In the experiment whose discussion gave rise 
to this view, they compared conditioning of the eye blink 
when the stimulus producing the wink was a puff of air de- 
livered to the cornea, with the conditioning obtained when 
the subject voluntarily winked at a prearranged signal. In 
general, the differences they obtained between these two 
methods were considerable, and they concluded that volun- 
tary and involuntary conditioning widely differed from each 
other. 

The purpose of the present experiment has been to attempt 
to determine whether or not conditioning of the finger re- 
traction to shock may be considered as voluntary or invol- 
untary conditioning. This has been approached by roughly 
paralleling Marquis and Porter’s situation, namely, by com- 
paring the results for a group of subjects for whom the 
stimulus to flex the finger was a shock to that finger with the 
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results of a second group which flexed the finger voluntarily 
when a light flashed on. If the statement of Marquis and 
Porter concerning the voluntary nature of finger conditioning 
is correct, we might expect to find no differences, or at least 
very slight differences, between these two groups, since both 
groups would be considered as cases of voluntary conditioning. 
On the other hand, if the differences between the two groups 
were very great, this could be taken as some evidence that 
in one situation we have involuntary conditioning and in the 
other situation we have voluntary conditioning. 


A pparatus.—In brief outline, the apparatus arrangement was as follows. There 
was a conditioning board to which the subject’s arm was strapped. A system of levers 
connected to the subject’s middle finger made possible the recording of all movements 
of this finger upon a polygraph, on which was also recorded the incidence of the con- 
ditioned and unconditioned stimuli. A special timing device controlled the adminis- 
tration of these stimuli, a buzzer (the conditioned stimulus), and a shock or a light 
(the unconditioned stimulus). 

The conditioning board proper consisted of a plank about 30” by 9”. An adjust- 
able strap could be pushed against the subject’s elbow to prevent him from making 
horizontal withdrawal movements of the arm, while a strap over the upper forearm 
prevented him from lifting the arm from the board. His hand was restrained by 
means of a padded wooden strip about 9” by 14”” which was screwed down across 
the back of the hand midway between the wrist joint and the first knuckles. The 
middle finger—the one chosen to be conditioned—was enclosed in a small clamp, 
which was connected by means of a small spring coil about 3” long to a bar running 
parallel to the long axis of the board. The bar was 4” long, and, at its pivotable 
point, it was attached rigidly to a second bar, which was perpendicular to it, and, in 
general, to the surface of the conditioning board. Connected to the second bar in 
such a manner as to allow pivoting in the vertical plane was a third bar which ran 
across the surface of the polygraph. A small well pen on the third bar rested on the 
polygraph paper and recorded the finger movements of the subject. The entire lever 
system was light in structure and moderately free from friction so that it offered only 
slight resistance to movement. This type of recording system had several advantages, 
for, in the first place, it permitted a study of the form and amplitude of the finger 
movement, as well as the frequency and speed of its occurrence, and, in the second 
place, it magnified finger movements about twice the original size. 

The shock, which was the unconditioned stimulus for one of the groups, was 
delivered from a high voltage power pack assuring a fair degree of uniformity in the 
strength of the stimulus. One of the electrodes consisted of a piece of metal about 
3”’ by 13” which was covered with a lamp wick. Before the experiment, this was well 
saturated with salt and then strapped with adhesive tape to the volar surface of the 
arm near the thumb joint. The other electrode was the clamp to which the middle 
finger was fastened. The current was supplied to this clamp by means of a spring 
under the bar on which the clamp was attached. Spring and bar were in contact, 
thus permitting the current to flow up to the clamp; but when the finger was raised 
the contact was broken, and the shock could be avoided. 
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For the other two groups, the signal to which the subjects were required to lift 
the finger consisted of the illumination of a six volt lamp. This lamp was about three 
feet directly in front of the subjects and on the level of the eyes. 

For all groups the conditioned stimulus was produced by a rather loud sounding 
buzzer, which was attached to the conditioning board to the right of the subject’s hand. 

The constant relationship between the time interval of the conditioned and the 
unconditioned stimulus was determined by a modified kymograph driven by a syn- 
chronous motor.! The drum was electrified and covered with heavy paper through 
which slots were cut at specified intervals. Holerith brushes, coming in contact with 
the drum through the slots, picked up the current and activated the buzzer, light, 
shock relay, and signal markers. By means of switches, the experimenter could 
determine what stimuli were to be given, and could also control the time intervals 
between stimulations. 

The paper on the polygraph, on which the response of the subject and the incidence 
of occurrence of the various stimuli were recorded, ran at a constant speed of two 
centimeters per second. This made it possible for units of 1/2oth of a second to be 
measured with considerable accuracy. 

All the apparatus described was placed on one table. The corner and side, on 
which the conditioning board was placed, was screened off with black cloth. Thus, 
the subject, seated at the board, was confined in a rather small space, and could see 
only the bare wall to the left and behind him, a black screen in front of him, and the 
conditioning board and screen to his right. 

Two experimenters were required to perform the experiment, and they sat in the 
same room as the subject. The room was not soundproof but most of the street 
noises, as well as the noises made by the experimenters, were screened by the regular 
sounds made by the polygraph and the timing drum. 

Procedure.—The experiment in general consisted of three different procedures, one 
involving involuntary conditioning, and two involving voluntary conditioning.2? The 
subjects of the involuntary group, or shock group, lifted the finger when they received 
a shock applied directly to that finger. One voluntary group, the light group, merely 
was required to lift the finger as rapidly as possible when a light flashed on; the other, 
the light-threat group, was also directed to lift the finger in response to a light, but, 
in addition, these subjects were told that if they did not respond with sufficient rapidity, 
they would receive a shock from a pair of electrodes which were strapped to the other 
hand. This, however, was only an idle threat, and the subjects in this group were 
never shocked. 

The instructions were given to the subjects after they were strapped into the 
apparatus, and in general these directions emphasized speed in responding to the light 
for the voluntary groups, and, for all groups, emphasized the maintenance of a passive 
attitude toward the experiment. 

The unconditioned stimulus (shock or light) alone was administered for three 
times, and immediately after this there were three presentations of the conditioned 


1 This piece of apparatus, which is called a continuously variable interval timer, 
was used through the courtesy of Dr. Samuel Renshaw. It was devised by Dr. Ren- 
shaw, and a more full description of it will appear later. 

2 The terms, voluntary and involuntary, are used at this stage only for convenience 
of description; a clearer interpretation of the terms is actually one of the purposes of 
this experiment and will be attempted in the discussion. 
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stimulus (buzzer) alone. There then followed 150 stimulations of the conditioned and 
the unconditioned stimulus together. In addition to these reinforcements, the un- 
conditioned stimulus alone was administered at irregular intervals for a total of six 
stimulations. For both groups, the buzzer sounded for one second, and a half a 
second after its onset, the unconditioned stimulus—either light or shock—was pre- 
sented, this ceasing with the termination of the conditioned stimulus. The time inter- 
vals between stimulations were the same for all subjects and they varied from five to 
25 seconds. At the completion of the 150 reinforcements, the unconditioned stimulus 
was omitted and the buzzer alone was sounded until there were either ten consecutive 
responses or three consecutive failures to respond. At this point the experimenter 
made the following statement to the subject: 


‘*Now, I’m going to present the (shock or light) and the buzzer together 
some of the time, and sometimes the buzzer alone will be sounded. I want you 
to be sure not to respond unless or until the (shock or light) is given.” 
Then ten reinforced stimulations were administered, and, following that, the condi- 
tioned stimulus alone was presented for five times, whether the subject responded or 
not. The performance of the entire experiment required about 50 minutes. 

Subjects.—The subjects used in the experiment were volunteers taken from the 
elementary Psychology classes at Ohio State University. The age range was from 
17 to 25 years, and the subjects were about equally divided as tosex. There were 18 
in the Shock group, 19 in the Light group, and 17 in the Light-threat group. 


RESULTS 
1. Acquisition of the Conditioned Response 


Table I and the graph in Fig. 1 show the percentage of 
times anticipatory conditioned responses occurred for each 
tenth of the conditioning series of 150 stimulations. Since 
there are 15 stimulations for each tenth and 17, 18, and 19 
subjects in our three groups, 100 percent would represent a 
total of 255, 270, and 285 stimulations respectively. It was 
considered that an anticipatory conditioned response occurred 
if the subjects responded before the unconditioned stimulus 
occurred or within 50 milliseconds after its occurrence. The 
figure of 50 milliseconds was chosen after it had been shown 
that the reaction time to the unconditioned stimulus alone 
was never less than 100 milliseconds; thus, it is safe to assume 
that a reaction of 50 milliseconds or less must be a reaction to 
the conditioned and not to the unconditioned stimulus. 

An inspection of Fig. 1, the curve drawn from the results 
shown in Table I, indicates that there is a decided difference 
between the involuntary and voluntary groups with respect 
to the curve of the acquisition of the conditioned response. 
The curve for the involuntary or shock group is fairly regular 
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TABLE I 


SHOWING THE PERCENT FREQUENCY OF CONDITIONED RESPONSES PER TENTHS OF 
THE CONDITIONING SERIES 


Tenths of the Conditioning Series 


Groups Aver- 
I 2 3 4 5 6 7 8 9 10 a. S.D 
cent 
“pl 29.0 | 50.3 | 50.0 | 59.6 | 62.3 | 59.6 | 59.6 | 64.3 | 60.6 | 56.0 | 51.7 | 35.2 
ight- 
Threat.| 8.3] 7.0] 6.9] 6.4] 8.5 | 11.9] 13.9 | 20.8 | 20.1 | 19.6] 12.6 | 13.2 
Light....] 3.9] 4.2] 6.0] 109]12.9| 7.7 | 11.0] 16.7] 20.3] 6.7] 7.2 
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1 2 3 7 5 6 7 8 9 10 
Tenths of the Conditioning Series 
Fic.1. A graph of Table I, showing the percentage frequency of conditioned responses 
per tenths of the conditioning series. 


and is negatively accelerated.* In its general form it is quite 


3 Of incidental interest is the falling off of the curve in the ninth and tenth units. 
It is very likely that this is a real phenomenon and not an artifact of the experimental 
procedure. ‘Two possible explanations for it may be offered. It may be due to fatigue 
or growing discomfort; or, on the other hand, it may be the result of the conditioning 
phenomenon which Hovland (4) has called inhibition of reinforcement. In the present 
experiment there is no way of determining which of these factors may be responsible. 
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similar to the curve for the acquisition of the wink reflex 
reported by Hilgard (2). In rather marked contrast to this 
curve are those showing the acquisition of the voluntary con- 
ditioned response. In neither of the curves is there as much 
regularity in the trend, and in the light group in particular 
there is a large amount of fluctuation. Despite its fluctua- 
tions, however, this curve does show a general trend of posi- 
tive acceleration. The somewhat less variable curve for the 
light-threat group may, with a slight stretch of the imagina- 
tion, be thought of as S shaped, since it reaches a percentage 
of approximately 20 at the eighth unit of the series and main- 
tains that level for the last two tenths. The study of the 
form of the curves of acquisition for our three groups, then, 
suggests a definite difference between our voluntary and our 
involuntary groups. Whether or not there is a difference 
between our two voluntary groups it is more difficult to 
determine, but it is not unlikely that this is faintly suggested 
by these data. 

It is significant that, for both of our voluntary groups, 
there is a trend toward an increasing number of conditioned 
responses with increased training. ‘This is in marked con- 
trast to the findings of Marquis and Porter, who found no 
increase in voluntary conditioned wink reactions, and to those 
of Yacorzynski and Guthrie (11), who, using voluntary hand 
withdrawal to a shock on the other hand, found an actual 
decrease in the number of voluntary conditioned responses. 
One possible explanation of this may lie in the fact that these 
experimenters used approximately only a third as many stimu- 
lations per sitting as was used in the present experiment. It 
will be noted that the real increase in frequency in our data 
occurs after 60 stimulations, and, if the experiment had not 
continued beyond that point, our conclusions would have 
been in accord with theirs. In addition to this explanation, 
we may accept the one made by Yacorzynski and Guthrie 
that their occasional presentation of their conditioned stimu- 
lus alone during the training served to develop inhibitory 
tendencies. This procedure was not followed in the present 
experiment or in that of Marquis and Porter, so this extra 
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difference may account for the fact that the three different 
experiments have somewhat different results. 

2. Descriptive Features of the Conditioned Response 

(a) Percentage Frequency of Response.—The last column 
in Table I gives the total percentage frequency of response 
and the standard deviation for each of our groups. Here 
again differences between the voluntary and the involuntary 
groups evidence themselves. The percentage of response of 
51.7 which our shock group gives is significantly different 
from the percentage of 6.7 given by the light group, and from 
the 12.6 percent given by the light-threat group, the critical 
ratios of these differences being 5.40 and 4.73 respectively. 
The light-threat group shows a tendency toward greater con- 
ditioning than does the light alone group, the response fre- 
quency of the former being nearly double that of the latter; 
this difference, however, is not significant. 

(b) The Relationship between the Frequency and the Latency 
of the Conditioned Response——The rank order correlations 
obtained between the average latency of the conditioned re- 
sponse and the frequency of the conditioned response are 
—.71 for the shock group; .o1 for the light group; and .34 
for the light-threat group. Since this last mentioned coeff- 
cient is not even over three times its probable error, it prob- 
ably should be considered in about the same light as the 
coefficient for our light group. ‘The coefficient for the shock 
group is well over four times its probable error, and it corre- 
sponds very closely with the correlation of —.77 that Marquis 
and Porter obtain with their involuntary group. Here again 
a disparity between the voluntary and the involuntary groups 
becomes apparent, for there seems to be no relationship be- 
tween the latency and the frequency of the voluntary con- 
ditioned response, while a fairly high relationship between 
these measures seems to exist in the case of the involuntary 
group. 

(c) The Form of the Conditioned Response.—All the re- 
sponses made by the subjects were classified into four different 
types according to their form. ‘This classification was, of 
course, rather subjective, and some errors must necessarily 
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have crept in. In Fig. 2 representative examples of each of 
these four types are given. Type A consisted of a rapid rise 
of the finger from its rest position to its highest response level 
and the approximate maintenance of that level for about two 
seconds before its return to the resting position. Type B 
showed the general recruitment phenomena. In this case, 
the finger did not rise suddenly, but showed a slight initial 
movement or a gradual build-up to the full response to the 


Type B Type C 


Type A 


Fic. 2. Showing examples of the characteristic forms of the responses used in the 


classification. ‘The response starts at the left. 


unconditioned stimulus. Quite different from either of these 
two type responses was Type C, showing a sharp and sudden 
ascent of the finger followed immediately by an equally sharp 
and sudden descent. The last type was a bi-modal response 
which occurred so seldom and was so ambiguous in its other 
characteristics that it does not warrant description. 

The percentage of conditioned responses only, falling into 
these various classifications, will be found in Table II. From 


TABLE II 


SHOWING THE PERCENTAGE OF CONDITIONED RESPONSES FALLING INTO EACH OF THE 
CLassiFIED Forms or RESPONSE 


Group Type A Type B Type C Bi-modal 
74.44 22.36 2.24 .96 
Light-Threat..... , 45.70 16.55 32.76 4.98 
ee 33.72 21.89 43-79 59 


the study of these results, two significant relationships seem 
to stand out. 


First, there appears to be a characteristic type 


of response for the voluntary group and another for the in- 
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voluntary group. Type A occurs in the shock group with a 
percentage frequency of 74.4, in the light-threat group with 
a percentage of 45.7, and in the light group with a percentage 
of 33.7. The critical ratio between the shock group and any 
of the other groups is significant, being in all cases over 9. 
The critical ratio between the light-threat group and the light 
group tends toward significance, it being 2.55. The reverse 
of this occurs in the Type C responses, where the light group 
leads with a percentage frequency of 43.79, the light-threat 
group is again intermediate with a percentage of 32.76, while 
the shock group is last with 2.24 percent. Again the critical 
ratio between the shock group and either of the other groups 
is above 9, while the critical ratio between light-threat and 
light is 2.29. The other interesting fact is that our Type B 
or recruitment response is present in all three groups, but 
with almost equal frequency. It is no more characteristic 
of one group than of another, the percentages for the shock, 
light-threat, and light groups being respectively 22.36, 16.55, 
and 21.89 with the critical ratios between groups being in all 
cases less than one. 

In connection with the consideration of the form of the 
response it might be noted that Kellogg (6) found present in 
dogs two types of response which seem to correspond with 
our types A and B. It may be that the Type A response, 
although foreign to reflex reinforcement of the eyelid, is found 
fairly commonly in retraction to shock regardless of the 
species. 

(d) The Strength of Conditioning. —lf we consider the num- 
ber of responses to the buzzer alone after 150 paired stimu- 
lations (for only those subjects who responded at least once 
to the buzzer at this time) notable differences between our 
voluntary and involuntary groups appear. These results are 
shown in Table III. The mean number of unreinforced elici- 
tations for the shock group was 5.5 + 4.0, for the light-threat 
group was I.I + .33, and for the light group was 1.6 + .55. 
Again the differences between the shock group and the other 
groups are statistically significant, the critical ratios being 
above three, and, needless to say, the differences between the 
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TABLE III 


SHOWING THE NuMBER oF SuByEcTs Wuo Gave at Least One RESPONSE TO THE 
Buzzer ALONE; THE MEAN NuMBER OF Responses BEFORE 
EXTINCTION; AND THE STANDARD DEVIATION 


A Numbe Stand 
Group N Deviation 
Light-Threat........... 8 33 
8 1.6 55 


other two groups is not significant. In fact, the distribution 
of the responses for these two groups is highly similar, there 
being only one subject in each of these groups who gave more 
than one unreinforced elicitation. 

Considering the responses to the buzzer alone from a 
slightly different angle, we find that the voluntary and the 
involuntary groups differ markedly in the number of subjects 
who gave at least one response to the buzzer alone, the figures 
being 15, 8, and 8 for our shock, light-threat, and light groups, 
respectively. | 

(e) Inhibitability.—After directions had been given to in- 
hibit the conditioned response, 10 reinforcements were pre- 
sented, and it was found that 66.6 percent of the shock group 
continued to make conditioned responses, while 23.3 percent 
of the light-threat group and 26.3 percent of the light group 
continued making them. Again, differences between the in- 
voluntary and the voluntary groups are statistically signifi- 
cant, whereas the difference between the light and the light- 
threat group is insignificant. 


Discussion 


With the data which have been obtained in this experi- 
ment, it becomes possible to discuss with some degree of 
definiteness the validity of the view that the conditioning of 
finger retraction to shock is essentially voluntary condition- 
ing. We have seen in our results that in measure after 
measure the shock or involuntary group differs significantly 
from the 2 voluntary groups. This difference holds true 
whether we consider the frequency of conditioned responses, 
the curve of acquisition, the predominant form of the response, 
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the number of non-reinforced elicitations, or the ability to 
inhibit the response. Since it is true that so many differ- 
ences exist between our voluntary and involuntary groups it 
does not appear that one is justified in maintaining that finger 
retraction conditioning to shock is basically voluntary con- 
ditioning. It seems to the writer that the differences are so 
great and so numerous that these two general conditions must 
be differentiated. 

This study does, however, bring up the question as to 
whether or not we are permitted to talk about voluntary and 
involuntary conditioning at all unless we are using these terms 
only for the sake of convenience. Marquis and Porter have 
already mentioned the lack of any categorical distinction 
between the two states, and Peak’s (8) work on reflexes lends 
strong support to this view. It seems to us that the present 
experiment further emphasizes the necessity of thinking of 
voluntary and involuntary as extremes of the same series 
rather than all-or-none phenomena. 

Several lines of evidence from the present experiment may 
be cited in support of this idea. In the first place, it is 
notable that all our groups have much in common. All of 
them have fairly definite curves of acquisition, and in general 
we may say that, up to a certain limit, there is an increase 
in response frequency with increased training. Also, it is 
obvious that conditioning does occur even if it is more fre- 
quent in our involuntary than it is in our voluntary groups. 
Again, all groups give all different forms of response although 
the relative frequency of the types given may vary. Lastly, 
once the response has been established, it is not completely 
inhibitable, although it is less inhibitable in the shock group 
than in the other two groups. 

A second line of evidence comes from our light-threat 
group. In general, we find that this group occupies a some- 
what intermediate position between our shock and our light 
group. In all cases it differs significantly from the shock 
group, and, although there is never a critical ratio of greater 
than three between it and the light group, there is one case 
where the critical ratio between them reaches 2.55. In addi- 
tion to this ratio which borders on the accepted one of three, 
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we must take into consideration the fact that the tendency 
for this group to lie between the shock and the light is a 
fairly consistent one. It differs from both groups in the curve 
of acquisition, stands between both groups in percentage fre- 
quency of response as well as in the predominant form of the 
response, and is about equal to the light group in the number 
of unreinforced elicitations and also in inhibitability. Weare 
probably justified, then, in concluding that behaviorally this 
group is somewhat different from our light or pure voluntary 
group, and, in spite of marked differences from the shock 
group, shows a trend in that direction. 

If we accept this conclusion, and keep in mind the fact 
that there is some similarity between all three of these groups, 
then it makes the problem of voluntary and involuntary con- 
ditioning a simpler one. We may look upon voluntary con- 
ditioning as being builded upon a type of unconditioned 
stimulus, the response to which ‘is easily influenced by other 
stimuli. ‘These stimuli may be external ones, or they may be 
such internal ones as implicit speech reactions, but, in either 
case, they may result in inhibition or facilitation of the re- 
sponse. Involuntary conditioning, on the other hand, may 
be conditioning in which the unconditioned stimulus produces 
a response which is not so easily interfered with. This view 
would lead us to expect all different degrees of ‘voluntariness’ 
or ‘involuntariness,’ depending on the unconditioned stimu- 
lus that was employed and also on the response mechanism 
which was involved. We would, for example, expect to ob- 
tain better conditioning to a strong shock than to a weak 
one—a finding which the writer has often observed—just as 
our shock group was better conditioned than our light group. 
We would also expect that certain responses—the eyelid re- 
flex, for example—would give different results in a condition- 
ing experiment than would finger withdrawal from shock, 
since inhibition by other stimuli of the former response may 
be more difficult to produce than is inhibition of the latter 
response. It is obvious that other factors such as time inter- 
vals (10), the intensity and nature of the conditioned stimulus 
(5, 9), or the neural routes involved (3) would play their part, 
but this does not invalidate our view. 
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Culler (1) has divided the function of the unconditioned 
stimulus into two parts, a directional and a motivational one. 
If we consider the strength of the motivation as determined 
by the ease with which other stimuli can inhibit the motivated 
response, Our interpretation given above seems the same as 
that of Culler. In our experiment, the directional character 
of the unconditioned stimulus was perhaps at a minimum since 
this was mostly supplied—particularly in the case of the light 
and the light-threat groups—by verbal directions given before 
the experiment was begun. The motivation for the light 
group was the rather weak one of pleasing the experimenter 
and doing well in the experiment; in the light-threat group 
there was, in addition to this, the fear of punishment if the 
act was not well performed; while an extra factor for the 
shock group was the rather painful shock. With these vary- 
ing degrees of motivation we found varying degrees of efficacy 
of conditioning. It is not inconceivable that one might be 
able to arrange all the responses which have ever been con- 
ditioned in the laboratory along a scale, according to the 
motivational force of the unconditioned stimulus. This moti- 
vational force, or the ability of the unconditioned stimulus to 
withstand interference, might be found to correlate with the 
behavior in the conditioning situation, and those conditioned 
reactions which have been called ‘voluntary’ and ‘involun- 
tary’ would prove to be only the extremes of the scale. 


SUMMARY 


The purpose of the experiment was to determine the 
validity of the contention that conditioning of finger retrac- 
tion to shock is voluntary conditioning. Three groups of 
subjects were used in a conditioning situation. For one 
group, the unconditioned stimulus was an electric shock ap- 
plied to the finger; for another, it was the flashing on of a 
light, to which the subjects had been previously directed to 
respond by lifting the finger; a third group was instructed to 
respond in the same way to the same signal of a light, but, in 
addition, they were told that they would receive a shock if 
they did not respond quickly enough. The group which re- 
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ceived a shock is termed the involuntary group and the other 
two the voluntary groups, for purposes of convenience. 

The results indicate that: (1) There is a considerable dif- 
ference between the involuntary and the voluntary groups in 
the curve of acquisition of the conditioned response, the 
amount of conditioning, the relationship between the latency 
and the frequency of the conditioned response, the predomi- 
nant form of the response, the speed of extinction, and the 
ability to inhibit the response. (2) There was a tendency for 
the light-threat group to occupy a position somewhat between 
the light and the shock groups. (3) In almost all respects, 
all three groups showed some degree of similarity. 

The author concludes that conditioning of finger retraction 
to shock is not voluntary (as that term has been used), and 
further suggests that the dichotomization of conditioning be- 
havior into categories of voluntary and involuntary condition- 
ing is not justifiable. 


(Manuscript received January 16, 1939) 
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THE EFFECT OF RANDOM ALTERNATION OF 
REINFORCEMENT ON THE ACQUISITION 
AND EXTINCTION OF CONDITIONED 
EYELID REACTIONS! 


BY LLOYD G. HUMPHREYS 
Department of Psychology, Institute of Human Relations, Yale University 


The fundamental concept in conditioning theory is that 
the conditioned response develops with repeated reinforce- 
ment and diminishes (extinguishes) with continued non- 
reinforcement. The extent of either acquisition or extinction 
has been supposed to depend on the number of reinforcements 
or non-reinforcements, respectively. Hull (17, 19) and Spence 
(36), for example, have explicitly assumed that specific in- 
crements of strength of conditioned responses result from each 
reinforcement, specific decrements from each non-reinforce- 
ment. Rational learning curves have been constructed by 
others (9, 10, 37, 39) upon the similar assumption that re- 
peated rewards or frustrations produce corresponding incre- 
ments or decrements. 

Since it is customary to reinforce every trial during ac- 
quisition of conditioned responses, and to omit reinforcement 
entirely during extinction, the increases appear to correspond 
directly to the frequency of reinforcement, the decreases to 
the frequency of non-reinforcement. There are a few frag- 
mentary data, however, which suggest that the relationship of 
acquisition and extinction to the frequency of reinforcement or 
non-reinforcement is not so simple. Pavlov, for example, 
reports experiments on one dog (28, p. 384) in which a condi- 
tioned reflex was first developed by the ‘2oth application,’ 


1 This article is from a dissertation submitted to the Department of Psychology 
and the Committee on Graduate Study of Stanford University in partial fulfillment 
of the requirements for the degree of Doctor of Philosophy. The writer wishes to 
express his indebtedness to Professor E. R. Hilgard under whose direction the work 
was done and to thank Professors C. P. Stone and R. T. Ross for their assistance in 
the preparation of the original manuscript. 
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which is stated to be about average (28, p. 385). Using a new 
conditioned stimulus and reinforcing only one-third of the 
trials, the response developed by the ‘7th application,’ but 
the animal became ‘extremely excited.” The norm for sub- 
sequent reflexes after the first has been established is from 
‘3 to 5” (28, p. 385). 

Skinner (35) has made considerable use of a technique 
which he calls ‘periodic reconditioning’ in which reinforce- 
ment occurs only at regular time intervals, intervening re- 
sponses being non-reinforced. He states that, except at very 
short intervals, the rate of responding is inversely proportional 
to the length of the interval. Periodic reconditioning builds 
up a greater ‘reserve’ than continuous reconditioning. 

By use of another technique, ‘reinforcement at a fixed 
ratio,’ Skinner has obtained as many as 192 responses in rapid 
succession for one reinforcement. Here the relationship 
between frequency of reinforcement and rate of response 
breaks down. Presumably a large reserve is again created. 

Razran (31) makes the statement that continuous rein- 
forcement may decrease the frequency of conditioned re- 
sponses developed to shock and that optimum distributions 
are needed. Such distributions are supposedly especially 
important after some training; 1.¢., after the reflex has been 
‘established.’ In support of this conclusion he states that 
Israelson (24) and Platonov (30) have given many trials 
without reinforcement after the conditioned reflex was once 
established. 

Evidence pertinent to Razran’s conclusion has recently 
been reported by Brogden (1). After establishing three differ- 
ent responses by continuous reinforcement, ‘frequency’ of 
reinforcement was varied. No significant decrements were 
observed when only one-fifth of the trials were reinforced. 

The use of the test-trial procedure in conditioning experi- 
ments results in occasional non-reinforcement. Wolfle (41), 
in a minor part of her study, omitted the 30 test-trials from the 
600 acquisition trials in the first 3 practice periods for one 
group of subjects and compared their acquisition results with 
those from her usual procedure where the test-trials were 
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always included. Differences were in favor of the no-test- 
trial group, but they were not significant statistically. 


Experimental Procedure.—In order to test the effect upon acquisition and extinc- 
tion of reinforcing only one-half of the trials, it was necessary to choose a standardized 
conditioning situation. The eyelid reaction was chosen as being eminently suitable 
(18) and convenient. Three experimental groups were found to be necessary. The 
main experimental group received reinforcement on only half of the trials during 
acquisition, an arrangement to be described as 50 percent reinforcement. Acquisition 
(and subsequent extinction) could then be compared with the results from control 
groups receiving reinforcement each time the conditioned stimulus was presented, 
i.¢., 100 percent reinforcement. One of these control groups was given as many total 
trials as the experimental group, each trial being reinforced; the other was given as 
many reinforced trials as the experimental group, rest intervals being substituted 
for the non-reinforced presentations of the conditioned stimulus. 

The essential features of the apparatus have been described by Dodge (6) and 
by Hilgard and Campbell (12) except for one modification: lever-recording of eyelid 
movement is used in place of the earlier artificial-eyelash-recording. The eyelid is 
attached to a light wooden lever by athread. A small mirror mounted on the fulcrum 
of the lever reflects light onto the film.2 Several advantages result from this arrange- 
ment: the distracting bright light shining across the line of vision, necessary in the 
other method of recording, is now absent; complete closures are recorded; magnifica- 
tion of response is constant at 1.75 throughout the entire range of movement, ampli- 
tude being read to the nearest millimeter. 

The conditioned stimulus, an increase in illumination of approximately 15 foot- 
candles behind a panel of flash glass, was followed at an interval of approximately 
400 msec. by the unconditioned stimulus, a puff of air to one cornea. After 12 reflex 
adaptation trials had been given, 6 to the puff alone and 6 to the light alone, in that 
order, the three groups of subjects were given the conditions of 100 percent reinforce- 
ment-massed, 50 percent reinforcement, and 100 percent reinforcement-spaced pre- 
viously described. 


Group I—100 percent reinforcement-massed. Twelve reflex adaptation 
trials, 96 paired stimulations of the light and puff, and 24 extinction trials. 

Group II—so percent reinforcement. Twelve reflex adaptation trials, 48 
paired stimulations interspersed at random ? intervals by 48 non-reinforced trials, 
and 24 extinction trials. Otherwise the conditions for this group were exactly 
the same as for Group I. 

Group III—100 percent reinforcement-spaced. Twelve reflex adaptation 
trials, 48 paired stimulations interspersed by rest periods at exactly the same 
intervals as in Group II (in place of non-reinforced trials), and 24 extinction trials. 


Otherwise the conditions for this group were identical with those for groups I 
and II.3 


2 Suggestions for the present form of the apparatus have come from Peak (29), 
Cohen (4), and Rodnick (32). 

* The order deviated slightly from a random one in that no more than two trials 
of any one kind were given in succession, and there were 6 reinforced and 6 non- 
reinforced trials in each group of 12. 
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Trials were given every 30 seconds, with a I-minute rest interval between groups 
of 12 trials (6 trials and 6 rest intervals for Group III), each subject serving for an 
experimental session on each of two successive days. Subjects were male and female 
psychology students from elementary classes at Stanford University during the winter 
and spring quarters of 1937-1938. Nine men and 13 women were obtained for each 
group. Standard preliminary instructions were read to all, and introspections were 
obtained at the close of the experiment to routine questions. 


RESULTS 
Initial Comparability of Groups 


Since comparisons depend throughout on the assumption 
that the three groups would have yielded comparable results 
if they had been treated alike, it is important to secure some 
measure of their initial comparability. Previous results (2, 
3, 28), confirmed in this experiment (21), show that there is a 
significant relationship between reflex sensitivity, as measured 
by the reflexes originally present to the experimental stimuli, 
and the strength of conditioned responses which develop. 
The range of the 62’s ('7) obtained when each group is com- 
pared with every other for reflex responses to the light and to 
the puff within the initial adaptation trials is from .16 to 1.28, 
and the median is .60. It may thus be inferred that the 
groups were essentially alike in sensitivity, and if treated 
alike would have yielded similar conditioned responses. 


Acquisition 


Table 1 presents data indicating the extent and significance 
of the differences between the three groups of subjects in 
frequency and magnitude‘ of conditioned responses during 
acquisition.» Frequency shows that groups I and III are most 
alike and that groups I and II are least alike, but all of the 
differences could easily have arisen through the operation 
of chance alone. Magnitude indicates that groups I and II 
are most alike, the differences with Group III being somewhat 


‘ Trials of zero amplitude are included in magnitude. Although highly correlated 
with frequency, more of the variance of magnitude is attributable to amplitude of 
responses than to their frequency (22). 

5 In making these computations subjects were first matched on the bases of reflex 
sensitivity to light and to puff. The mean of the distribution of differences between 
matched individuals was then tested for the significance of its departure from zero. 
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TABLE 1 


Acquisition Scores FoR Two Periops or CONDITIONING 
(22 Subjects in Each Group) 


Frequency: Number of Magnitude: Millimeters on 
Responses over Record, Including 
Total Trials Zero Responses 
M M 
Group I (100 percent rein- 
forcement-massed)....... 65 .19 5.97 4.41 
Group II (50 percent rein- 
59 25 5.82 4.50 
Group III (100 percent re- 
inforcement-spaced)..... 64 25 6.68 5.28 
diff. diff. é 
.06 1.00 1.28 Al 
.06 17 1.10 65 
.08 63 1.48 .58 


larger, but they are again easily within the range of chance 
expectancy. 

As would be expected from the negligible total score differ- 
ences, the performance curves indicating the course of acquisi- 
tion over the two-day period are strikingly comparable. This 
is exemplified by the curve for frequency presented in Fig. 1. 
All of the acquisition trials are divided into 8 groups for 
facility in presentation. Thus each point represents 12 
reinforced trials for Group I, 6 reinforced and 6 non-reinforced 
trials for Group II, and 6 reinforced trials plus 6 one-half 
minute rest intervals for Group III. The break between 
trials 48-60 and I-12 indicates the 24-hour interval between 
the first and second days. Reflex adaptation trials 1-12 on 
the first day are, of course, omitted. 

The performance of Group II in frequency is somewhat 
lower on the first day than that of either of the other groups. 
By the second day differences are smaller. At no point do 
they approach significance. 

The preceding, presumably chance differences are in strik- 
ing contrast to the differing conditions of reinforcement in the 
three groups; 1.¢., 96 reinforced trials in Group I, 48 reinforced 
and 48 non-reinforced trials in Group II, and 48 reinforced, 
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but spaced, trials in Group III. In view of the rapidity of 
extinction of the eyelid reflex in man (12, 15, 16), present lack 
of effect from non-reinforcement in Group II is indeed para- 
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Fic. 1. Course of acquisition-frequency. The acquisition trials are divided into 
8 groups for facility in presentation. Thus each point represents 12 reinforced trials 
for Group I (100 percent reinforcement-massed), 6 reinforced and 6 non-reinforced 
trials for Group II (50 percent reinforcement), and 6 reinforced trials plus 6 one-half 
minute rest intervals for Group III (100 percent reinforcement-spaced). The break 
between trials 48-60 and 1-12 indicates a 24-hour interval between two days of experi- 
mentation. Reflex adaptation trials 1-12 on the first day are omitted. The essential 
comparability of the three groups is easily apparent. 


doxical.6 It is evident that the results are not sufficiently 
accounted for by an algebraic summation of reinforcements 
and non-reinforcements. 


6 There are also implications for distributed practice in the comparison of groups 
Iand III. This finding is discussed more fully elsewhere (23). 
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Extinction 


Table 2 presents total score differences in extinction for 
frequency and magnitude.’ Conditions during extinction 
were identical in all groups so that differences in scores are to 
be attributed to the conditions of reinforcement during 
acquisition. 

The group receiving 50 percent reinforcement is strikingly 
more resistant to extinction than either of the control groups 
(I and III), which are essentially alike. While the differences 


TABLE 2 


Extinction Scores For 24 Non-REINFORCED TRIALS 
(22 Subjects in Each Group) 


Frequency: Number of Magnitude: Millimeters on 
Responses over Record, Including 
Tota! Trials Zero Responses 
M M 
Group I (100 percent rein- 
forcement-massed)....... 31 24 1.69 1.16 
Group II (50 percent rein- 
.60 33 5-35 5.37 
Group III (100 percent re- 
inforcement-spaced)..... 34 27 2.50 2.06 
t diff. t 
.08 3.61 1.06 3-45 
.08 38 43 1.90 
.08 3.25 1.00 2.84 


between groups I and III are minimal, the ¢ values for the 
differences in both frequency and magnitude between Group 
II and the other groups suffice to indicate the probability of 
genuine differences. 

Figure 2 presents the performance curves in extinction 
for frequency. The 24 extinction trials are divided into 4 
groups of 6 trials each. An average for the preceding 24 
acquisition trials is plotted to serve as a reference point for 
the extinction results. The essential comparability of groups 
I and III is easily apparent, while Group II responds at a 


7™For these computations subjects were matched on the basis of performance 
during acquisition before following the procedure of footnote 5. 
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consistently higher level throughout than either of the others, 
A statistical comparison of the points plotted indicates that 
the differences are relatively larger early in the extinction 
series. 


Group I= 
Group 
Group 
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Fic. 2. Course of extinction-frequency. The 24 extinction trials are divided 
into 4 groups of 6 trials each. An average for the preceding 24 acquisition trials is 
plotted to serve as a reference point for the extinction results. The essential com- 
parability of groups I and III (100 percent reinforcement) is easily apparent, while 
Group II (50 percent reinforcement) responds at a consistently higher level through- 
out than either of the others. 


It is. obvious that comparable performance during ac- 
quisition does not lead in all cases to comparable performance 
in extinction. Group II, given 50 percent reinforcement in 
acquisition, was somehow enabled to respond at a significantly 
higher frequency and magnitude during extinction than groups 
given 100 percent reinforcement. This differential effect of 
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50 percent reinforcement as contrasted to 100 percent rein- 

forcement for extinction, in spite of similarities in the course of 

acquisition, requires interpretation. 


DIscussION 


The results just presented are definitely contradictory to a 
conditioning theory which posits specific increments of excita- 
tory strength for each reinforcement and specific decre- 
ments for each non-reinforcement. From the theory one 
would predict that Group I should show the greatest amount 
of acquisition; Group III, receiving only one-half the rein- 
forcement, should follow at a slower rate; and Group II, 
also receiving only one-half the reinforcement plus an equal 
number of non-reinforced trials, should show little if any 
acquisition. Contrary to expectations, acquisition is es- 
sentially alike for all groups. According to conventional 
theories, one of two results might be expected within extinc- 
tion. Since Group I had double the number of reinforce- 
ments of groups II and III, it might be expected to extinguish 
most slowly (40, 42). Or, since all groups were essentially 
alike at the end of the reinforcement session, they might all 
be expected to extinguish equally. Both of these expecta- 
tions are contradicted, since Group II, with 48 reinforcements 
and 48 non-reinforcements, and not superior at the end of the 
acquisition period, extinguished by far the slowest. 

The results are likewise inconsistent with the concept of 
‘excitatory-potential’ recently made explicit by Hull (20). 
He states that ‘‘an excitatory-potential is a central condition 
or state of an organism which, given appropriate circum- 
stances, will result in a specified form of action.” The 
magnitude of the potential is dependent upon numerous 
factors in the conditioning situation and in the situation in 
which the potential later becomes actual. Among these are 
the number of reinforcements, the number of responses in 
extinction, the drive during reinforcement and during extinc- 
tion, etc. The excitatory-potential is then manifested in 
several ways. Among these are amplitude, latency, number 
of frustrations undergone, and time for extinction. Potentials 
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of equal strength will always be manifested in exactly the same 

way. When it can be shown, however, that three groups of 

subjects have excitatory-potentials of approximately equal 

strength for the period of acquisition (no significant differences 
n in frequency and magnitude of responses), but that one of 
these (Group II) has an apparently stronger potential for 
extinction, sufficient evidence is available to deny the general 
validity of the hypothesis. 

Skinner (35) differentiates between the ‘reserve’ and the 
‘strength’ of a reflex, the two including the characteristics of 
Hull’s excitatory-potential. Depending on the conditions 
of the experiment, equal strengths may result from unequal 
reserves. The results from the present experiment might thus 
be described in those terms; 1.¢., conditioned responses in 
the three groups are of equal strength at the end of acquisition, 
but a greater reserve has been built up in Group II through 
50 percent reinforcement. Skinner, however, makes the 
traditional assumptions concerning the effects of frequency 
of reinforcement and non-reinforcement, so he would be un- 
able to explain the development of the greater reserve in Group 
II. He would be at a loss, similarly, to explain the results 
from his own technique of ‘periodic reconditioning’ which 
was discussed in the introductory paragraphs of this paper. 


An Alternative Theory 


Since the conventional theory of conditioning is inadequate 
to the experimental data, some alternative is required to 
harmonize the results. A satisfactory alternative theory 
requires a new interpretation of the roles of reinforcement and 
of non-reinforcement. One such alternative is that condi- 
tioned responses are a consequence of anticipated reinforce- 
ment, extinction of anticipated non-reinforcement, and that 
the role of frequency in the repetition of reinforcement or non- 
reinforcement is by way of its influence on the subject’s- 
expectations of the stimuli which are to appear. Hilgard and 
his co-workers (11, 13, 14) have interpreted conditioned dis- 
crimination in this way, following the writings of Tolman 


and Brunswik (38) and Krechevsky (25, 26). 
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Acquisition depends on the probability that the light 
will be followed by the air-puff. In the cases of groups I and 
[II the light is uniformly followed by the air-puff, so that the 
more trials the higher the probability that the air-puff will 
follow. That the increase in rate within both groups is simi- 
lar, even though one has twice the reinforcements of the 
other, must be attributed to whatever factors cause the 
advantage in spaced over massed practice. This is discussed 
elsewhere (23). To account for the results in Group II a’ 
suggestion from Tolman and Brunswik is in order. They 
suggest that a cue to a noxious stimulus, even though it is 
‘ambiguous,’ is reacted to as ‘dangerous.’ Thus the light is 
reacted to as though it may be followed by the air-puff, even 
though the subject knows very well that it may not be so 
followed. This accounts for the development of the condi- 
tioned responses at normal rate in spite of the interspersed 
non-reinforced trials. 

Extinction, correspondingly, must depend on the expec- - 
tancy of non-reinforcement. The rapid extinction in groups 
I and III must be explained as a shift from the expectation of 
uniform reinforcement to that of uniform non-reinforcement. 
A single non-reinforced trial is a much more impressive ex- 
perience after uniform reinforcement than after many previous 
non-reinforced trials as in Group II. In fact, even after sev- | 
eral non-reinforced trials, the subjects of Group II may con- 
tinue to expect reinforcement, since reinforcement has often 
previously followed non-reinforcement. Hence the resistance 
to extinction is explained. 

These conjectures satisfactorily account for the data, 
provided, of course, that the conjectures may be substantiated. 
Two sets of evidence are available from within the reported 
experiments, first, certain individual trials in extinction, and, 
second, the introspective comments of the subjects. 


Individual Trials in Extinction 
The most significant trials for the ‘expectancy’ interpreta- 
tion lie within the first 6 trials and those immediately preced- 
ing and following the 1-minute rest period in the middle of the 
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extinction series. Data for frequency are quite representa- 
tive of other measures and are presented here in Fig. 3. The 
first point plotted is again the mean from the immediately 
preceding 24 acquisition trials to serve as a reference point 
for the extinction results. Very striking are the rapid drops 
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Fic. 3. Individual trials in extinction-frequency. The first 6 trials in extinction 
and the trials immediately preceding and following the 1-minute rest interval in the 
middle of the series are plotted here for the three groups of subjects. - The first point 
plotted is again the mean from the preceding 24 acquisition trials to serve as a refer- 
ence point. Very striking are the rapid drops for both groups I and III (100 percent 
reinforcement) and the slight rise on the third trial for Group II (50 percent rein- 
forcement). 


for both groups I and III and the slight rise on the third trial 
for Group II. Both facts fit perfectly the ‘expectancy’ 
hypothesis. That there is also a great deal of uncertainty in 
Group II even after the third trial is indicated by the slowness 
of the drop thereafter (no more than two non-reinforced trials 
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had ever occurred together previously). The uniform rise 
between the last trial before the rest period and the first trial 
thereafter is also congruent—at the start of a new series 
“anything can happen.” It is interesting to observe that this 
phenomenon is equivalent to that of spontaneous recovery 


(28), the recovery in this case taking place during a single 
minute. 


Introspections 


The introspective comments took the form of answers 
to three questions: (1) Did you notice the change in the last 
two groups of trials? (2) When did you become sure of the 
change? (3) What difference did it make to you? The 
importance of such introspections was not realized until 
after the first few subjects had been used, but all were ques- 
tioned thereafter. | 

When the answers to these questions were analyzed, it 
was found that subjects could be roughly classified into 3 
categories: (1) those who kept on expecting the air-puff 
throughout the 24 extinction trials; (2) those who changed 
their hypothesis more or less rapidly and decided that there 
would not be any more air-puffs; and (3) those who said that 
they were not thinking about the experiment and thus were 
not able to answer the last two questions. This analysis is 
presented’ in Table 3, including the number of those not 
questioned, the number and percentage of the total intro- 
spections that were placed in each of the categories, and the 
performance in acquisition and extinction of those in the 
categories as measured by percentage-frequency of response. 

Although there are individual exceptions, average fre- 
quency scores indicate that those who kept on expecting the 
air-puff, no matter in which group they are found, tended 
to give more conditioned responses than those who changed 
their hypothesis concerning the air-puff. That this is not a 
function of acquisition level is evident from a comparison of 
the respective percentage-frequencies. It should also be 
noted that there were more who kept expecting the air-puff 
in Group II than in either of the other groups. 
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TABLE 3 
ANALYSIS OF INTROSPECTIONS 


Percentage-frequency | 
N Percent of 
Introspections 


Acq. Ext. 


Group I (100 percent reinforcement-massed) 


No introspections......... 5 
Expected ag for 24 extinc- 

Changed hypothesis during 

12 70.6 65 28 
Didn’t think about experi- 

Group II (50 percent reinforcement) 

No introspections......... 4 
Expected puff for 24 extinc- 

8 44-4 62 66 
Changed hypothesis during 

5 27.8 50 35 
Didn’t think about experi- 

errr 5 27.8 57 73 

Group III (100 percent reinforcement-s paced) 

No introspections......... 3 
Expected puff for 24 extinc- 

6 31.6 66 55 
Changed hypothesis during 

Didn’t think about experi- 

I 5.3 68 8 


There are, however, two possible anomalous findings in the 
introspection results. Firstly, although there is the difference 
between those who expect the puff and those who do not, 
irrespective of group, Group II still responds at higher levels 
in each of the categories than do the other groups. This 
may be merely a ‘chance’ difference, or it may be due to 
differing degrees of expectancy not taken into account in the 
broad categories used. Secondly, with the exception of the 
single case in Group III, those who ‘didn’t think’ performed 
much more like those who kept on expecting the puff than 
those who didn’t. More information is needed to clarify 
this relationship. 
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Is Expectancy Purely Verbal? 


It might be supposed that the results reported for condi- 
tioned eyelid reactions merely reflect the verbalizing ability 
of human subjects, so that, in fact, eyelid reactions are un- 
suitable for conditioning experiments. While verbal abilities 
undoubtedly influence the results (14), they are not the sole 
explanation (13, 16). It has also been shown that expectancy 
of stimulating situations is not the same as preparation for 
specific acts (1§); 1.¢., conditioned eyelid reactions in human 
subjects are not under full voluntary control. 

Parallel results may be anticipated from experiments with 
lower animals. The results of Pavlov and Brogden, previ- 
ously mentioned, are similar to those presented here. Skin- - 
ner’s results from periodic reconditioning and reinforcement 
at a fixed ratio can best be understood in accordance with 
expectancy. A phenomenon comparable to the verbal condi- 
tioning of the psycho-galvanic response (5, 27) has been ob- 
served in lower animals, ¢.g., the conditioning without paired 
stimulation of Grether (8) and Sears (34). It is suggested 
that generalization is not a radically different phenomenon, 
the criterion of differentiation being merely the presence or 
absence of previous pairing of stimuli. This is strikingly 
evident in the cases of extremely wide generalization, ¢.g., 
from buzzer to light or vice-versa as in Schlosberg’s results 
(33). Hence it cannot be assumed, without proof, that the 
results as reported here would not also apply to non-verbaliz- 
ing animals. 


SUMMARY AND CONCLUSIONS 


In order to test the effects of random alternation of 
reinforcement on the acquisition and extinction of conditioned 
eyelid reactions, 66 college students were conditioned to a 
light followed by a puff of air to the cornea. This training 
occurred under three differing conditions, subjects being 
grouped according to the procedure used. Group I was given 
100 percent reinforcement for 96 trials, the light always being 
followed by the puff, followed by 24 extinction trials. Group 
II was given 50 percent reinforcement for 96 trials, the light 
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being followed by the puff on only one-half of the trials, 
followed by 24 extinction trials. Group III was given too 
percent reinforcement for 48 trials, with 48 interspersed rest 
intervals where non-reinforced trials occurred in Group II, 
followed by 24 extinction trials. The findings justify the 
following conclusions: 

1. There are no significant differences in the acquisition 
of conditioned responses, measured by frequency and magni- 
tude, under the three experimental procedures. This is clear 
evidence that non-reinforced trials do not always result in 
significant decrements (Group II). It also points to an ad- 
vantage of spaced trials over an equivalent number of massed 
trials (compare groups I and III). 

2. There are significant differences in the extinction of the 
conditioned responses, measured by frequency and magnitude, 
between Group II (50 percent reinforcement) and groups | 
and III (100 percent reinforcement). Group II responds at a 
significantly higher level over the 24 extinction trials than 
groups I and III. 

3. Neither the acquisition nor the extinction results are in 
harmony with classical conditioning theory which stresses 
merely the frequency with which reinforcement or non-rein- 
forcement occurs. An alternative interpretation, provision- 
ally designated the expectancy hypothesis, is coherent with 
all of the data within the reported experiment. 


(Manuscript received January 18, 1939) 
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I. INTRODUCTION 


Examination of reproductions of visually perceived forms 
has been the basis of much theoretical speculation concerning 
memory trace. Experiments in this field have given some 
excellent qualitative and quantitative descriptions of changes 
which occur in reproduction when compared with the stimulus 
figure. Unfortunately inferences of changes in reproduction 
to changes in the trace have been quite confusing. This is 
in part due to the limitations of reproduction as a method for 
studying changes in the trace and in part from a lack of under- 
standing of the process of reproduction. The chief aim of this 
experiment is to throw additional light upon the process of 
reproduction. An attempt will also be made to tie up the 
results with the general problem of memory trace. 

Recent experiments have improved methods of investiga- 
tion and have clarified many issues concerning the trace. 
We can no longer assume that a change in reproduction is a 
direct representation of an identical change in the trace. 
Hanawalt (4) in a recent monograph has shown that some 
change which was previously attributed to the trace is present 
in copies made by the Os of the stimulus material. Hence this 
much of the change must be attributed either to the mechanics 
of drawing or to the process of perception. In the same study 
it was demonstrated by the methods of single reproduction 
and recognition that much progressive change previously 
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attributed to the trace was an artifact of the method of suc- 
cessive reproduction. Woodworth, in his presidential address 
before the A.P.A. (9, 13 ff.), formulates the reaction theory 
of perception and interprets its implications concerning recall. 
He argues to the effect that our recall is limited by the reac- 
tions made during the learning period. Wulf (11), Gibson 
(3) and others have shown that reproduction of a figure is 
dependent to a great extent upon the manner of apprehension. 
Carmichael et al. (2), following the suggestion of Gibson, 
have demonstrated the possibility of determining the direction 
of change in the reproduction by giving a verbal suggestion 
in the learning period. Zangwill (12, 13 ff.), in connection 
with another problem, gives a beautiful demonstration of the 
influence of different mental sets upon the reproduction of the 
same figure. Kuhlmann (6), Bartlett (1) and others have 
emphasized the fact that reproduction is a construction and 
hence only partially dependent upon the trace.! The recogni- 
tion method, the relearning method, and the demonstration 
of reminiscence show that much is present in the trace which is 
not included in the reproduction. 

In the monograph referred to above, the writer noted 
several cases where a change in verbalization in delayed recall 
is accompanied by changes in the reproduced figure which 
seem to be the result of this changed verbalization. For 
example a figure associated with an arrow-head in immediate 
reproduction is remembered as a spear-head after eight weeks 
and a part of the shaft of the spear is introduced into the 
reproduction. 

The present experiments are designed to control experi- 
mentally the verbalizations of the recall period just as Car- 
michael et al. attempted such control in the learning period. 
Their figures and a modification of their method are used. 
They presented a list of twelve ambiguous figures to each of 
three groups of subjects. For example, to Group I they 
announced ‘‘the next figure will resemble a seven.” To 


1 For a further discussion of this point the reader is referred to Woodworth, R. S., 
Experimental Psychology, 1938, p. 77 ff. Chapter IV of this book gives a very com- 
plete history and a critical survey of this whole problem of memory for form. 
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Group II, using the same form, they announced ‘“‘the next 
figure will resemble a four.” To a third group, the con- 
trol group, they gave no verbal suggestion. Out of the 905 
reproductions of Quality Step Five, 74 percent in List I and 
73 percent in List II were like what may be termed ‘the visual 
representation of the figure named,’ compared with 45 percent 
like either of the two lists in the control group. 


II. Tue Present Prospiem, MATERIAL, AND METHOD 


If verbalization during the process of reproduction is a 
factor in determining the form which the reproduction takes, 
it should be possible to induce these changes experimentally. 
The general plan of the experiment is to present the figures 
to the Os for study. After a period of time the O is told to 
reproduce the figure which looked like a certain object. The 
detailed plan of the experiment will be brought out in the 
instructions to subjects. The forms (see Chart I) were photo- 
graphed individually and presented to the Os on a screen for 
group participation. The time of exposure was Io sec. for 
each figure. There was a Io sec. interval between each pres- 
entation. Each of the twelve ambiguous figures was shown 
once to each subject. The Os made only one reproduction 
of each figure. 

Subjects—All of the 148 students taking elementary psychology at New Jersey 
College for Women were used as subjects. There were three major groups: Suggestion 
Group I, Suggestion Group II, and the non-suggestion group or Control Group. Each 
of these three groups was subdivided into three groups for different intervals of recall— 
immediate, after two days, and after one week. 

Instructions to Subjects—All subjects had the same instructions for the learning 
period: 

A series of twelve figures will be flashed upon the screen. Each figure will be 


exposed for ten seconds. You are to study these figures carefully and observe 
their construction. 


They were not informed that they would be called upon for reproduction but many 
reported that they suspected it. 


Instructions for Suggestion Group I and Suggestion Group II in recall period: 


You are going to reproduce as accurately as possible the series of figures in the 
order that they were presented on the screen. You will be given suggestions to 
enable you to recall the figures but draw them as nearly like the figures on the 
screen as possible. If you do not remember a figure leave a blank space. Please 
do not draw a figure until that particular figure is called for. This is very impor- 
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Examples of types of change judged to be due to verbal suggestions 
In each case, the learning association, reported by the subject 


after recall, is different from the suggestion given in the recall period. 
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tant. Everyone will be drawing the same figure at the same time. Listen, 
I shall repeat the instructions. ... Are the directions clear? (Pause) (To 
Suggestion Group I) Draw the figure which resembled ‘curtains in a window.’ 
(To Suggestion Group II, in reference to the same figure) Draw the figure 
which resembled ‘a diamond in a rectangle.’ 


For the two lists of suggestions for the twelve ambiguous figures, see the printing 
under the two groups of reproductions in Chart I. 
Instructions for Control Group in recall period: 


You are going to reproduce as accurately as possible the series of figures presented 
on the screen. They do not necessarily have to be in the order presented. 


After the subjects had completed their drawings, and as far as they were concerned 
had completed the experiment, they were instructed as follows: 


Beside each figure you have reproduced, indicate what object or experience, if 
any, with which you had associated the figure while you were studying it on the 
screen. Do not erase or change any of the figures. 


To Suggestion Group I and Suggestion Group II the following was added to the above: 


Indicate also whether you remembered the figure presented or drew a figure called 
to mind by the suggestion given by the experimenter. 


III. Resutts 


1. Results Based upon the Judgments of Three Judges 


Following the procedure of Carmichael et al. (2, p. 78), 
one of the experimenters and a judge unacquainted with the 
theoretical background and purpose of the experiment judged 
independently each of the reproductions. They were as- 
signed to one of the categories in Table I. A third judge, 
the other experimenter, disposed of the cases where there 
was disagreement.?. Although our chief concern is with the 
effect of verbal suggestion in recall, other classifications 
are included in order to throw light on the fate of all the 
reproductions. ‘The fact that the subjects in over go percent 
of the cases report that they had associated the figure with an 
object in the learning period is a handicap in an experiment 
of this sort. However, in spite of this learning association, a 
considerable percentage of the reproductions show the effect 


2 At first the problem of judgment seemed like an impossible task. It was not 
until after considerable practice on some preliminary material, a careful study of the 
drawings, and an understanding among the judges on standards of judgment that we 
were able to proceed to the main results. The agreement between the two judges 
was far above our expectation. The second section of the results will give some pos- 
sible cues to use in judging. 
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TABLE I 
CLASSIFICATION OF THE REPRODUCTIONS BASED UPON THE JUDGMENT OF THREE 
JupcEs 
Classification of Reproductions* 
Recall | Like | No | Change | pratt. | Av.N: 
Period “| Learn- | Assn. Sugg. Chan . Unac- ings subj 
ing and in N counted 
Assn. | Sugg. | Recall - for 
Same 
% % % % % 


Control Group—No Suggestion 


ee 14 63 — —_ 18 18 164 11.7 

14 75 II 13 126 9.0 

18 93 I 6 107 5.9 
Suggestion in Recall Pertod—List I 

Peer 14 24 36 12 13 16 158 11.3 

re 17 34 30 19 6 II 193 11.3 

ee ee 21 18 38 33 2 10 200 9.5 
Suggestion in Recall Period—List II 

15 33 13 23 22 10 172 1.5 

17 40 18 29 8 5 170 10.0 

18 25 23 45 I 5 158 8.8 


* Like Learning Association means that the judges were able to detect a change in 
reproduction which seemed to be due to the learning association, reported by the sub- 
ject after recall, to have been present in the learning period. 

Learning Association and Suggestion Same means that the same word was sug- 
gested in the recall period as was reported by the subject to have been present in the 
learning period. In these cases it was not possible to separate the two factors. 

Like Suggestion in Recall means that the judges were able to detect in the repro- 
duction some evidence which they believed to be due to the verbal suggestion given 
in the recall period. 

No Change Noted means that the reproduction looked so much like the stimulus 
figure that it was considered a correct reproduction. 

Change Unaccounted for means that the change was of such a nature that it could 
be attributed neither to the learning association nor to the suggestion given in the 
recall period. 


of the suggestion in recall (see Table I, column Like Suggestion 
in Recall or Graph I, bottom curve). 

Indirect corroboration of Carmichael e¢t al. is apparent 
in the results of Table I. This is clearly shown for the Control 
Group in column Like Learning Association or graphically in 
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Graph I, top curve. These results, however, go beyond 
those of Carmichael et al. They suggest that if these authors 
would have used delayed recall they would have found an 
increased frequency of the verbal effect with delay in repro- 
duction. Examination of the individual figures reveals that 
the degree of change, as well as the frequency, increases in 
delayed reproduction. The effect of the learning association 
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——— Percent of reproductions in Control Group showing some modification which 
was judged to be due to the learning association. 

seeeee Percent of the reproductions of the suggestion-in-recall groups showing the 

effect of the learning association in reproduction (combination of the first 

two columns under Classification of Table I for the suggestion groups). The 

results of Suggestion Lists I and II are averaged. 

Percent of reproductions showing the effect of suggestion in recall. The 

results of Suggestion Lists I and II are averaged. 


Grapu I. The effect upon reproduction of the learning association compared 
with that of suggestion in recall. 


is also evident in the Suggestion Groups to a lesser degree. 
Under this condition we note that the effect of the learning 
association rises more slowly after two days and actually 
decreases after seven days. Apparently the suggestion in 
recall is making inroads upon the effect of the learning as- 
sociation. This effect increases with delayed recall. 

In Graph I we have combined the column Like Learning 
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Association and Learning Association and Suggestion the Same 
and have averaged the percentages for Suggestion Groups | 
and II to show the trend of the effect of the learning associa- 
tions under this condition. We have also averaged the 
percentages of Conditions I and II in the bottom curve of 
this graph, for although there are differences in absolute 
percent, the slope of the curves is approximately the same. 
This difference in size of percentages is due to the fact that 
more of the learning associations are like List I. Table II 
gives the distribution of the learning associations for the con- 
trol group. 
TABLE II 


LEARNING AssOcIATIONS OF ConTROL GrouP 
(n = 484) 

Learning Associations Percent 
Like Suggestion List I.............c0eeee 39 
Like Suggestion List II................05. 22 
No association 6 


The column Change Unaccounted for in Table I includes 
figures for which no learning association was reported. It 
also includes figures which exhibit a change that can be ac- 
counted for neither by the learning association nor by the 
suggestion given in recall. The number of figures which show 
no change decreases rapidly from immediate recall to recall 
after seven days. 

The average number of figures recalled per subject is 
much greater for the Suggestion Groups than for the non- 
suggestion group (Control Group). From the averages in 
Table I, and also from the introspections of the subjects, it 
would appear that the suggestions in some cases have an 
inhibiting effect. However, for delayed recall the suggestions 
far outweigh in advantage any inhibiting effect they may have 
had. This is further evidence that some apparent forgetting 
in reproduction is due to unavailability of the trace rather 
than to complete disappearance of the trace. 
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2. More Objective Evidence of the Influence of Verbal Suggestion 
in the Recall Period 


The evidence presented in the previous section is based 
upon the judgment of the experimenters. However, it is 
possible to secure evidence of a more objective nature. For 
some of the figures there are characteristic changes which 
can be counted, measured, etc. In securing this evidence, 
the learning associations, reported by the Os after recall, are 
disregarded. ‘Thus any inaccuracy on the part of the subject 
is ruled out. Furthermore, we see the effect of suggestion in 
recall despite the learning association. It would be interest- 
ing to make a similar analysis taking into account the learning 
associations but it is felt that the number of subjects in each 
group is too small here to make such an analysis. The effect 
of suggestion is seen in comparing the changes in the Sugges- 
tion Groups with those of the Control Group (non-suggestion). 
If the reader will refer to Chart I in connection with the analy- 
sis below, the measurements referred to will be more clearly 
comprehended. 

Graph II shows the effect of suggestion on the reproduction 
of the first figure. The suggestion of ‘diamond in a rectangle’ 
tends to bring the inside lines together at the bottom of the 
figure, whereas the suggestion ‘curtains in a window’ tends to 
keep them separated in reproduction. This effect is quite 
evident after seven days. 

Table III shows the effect of the learning association in the 
Control Group of these same lines for the same figure. Most 


TABLE III 


NuMBER OF Ficures Open-closed aT THE Bottom 1N CONNECTION WITH THE LEARN- 
1NnG ASSOCIATION OF THE CONTROL GROUP FOR THE 
*‘CurTAINS-DIAMOND’ FiGuRE 


Curtains in Diamond in Other No 
For all Recall Window Rectangle Association Association 
Periods 
Open | Closed | Open | Closed | Open | Closed | Open | Closed 
Povoant........ 80 20 33 67 50 50 100 ° 
15 9 4 12 
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of those who had associated the figure with ‘curtains’ leave 
the lines open at the bottom. ‘Two-thirds of those who had 
associated the figure with ‘diamond in a rectangle’ close the 
‘diamond.’ For other learning associations aside from these 
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two, half draw it each way. For those reporting no associa- 
tion, all of them leave the lines open as in the stimulus figure. 
It seems that this in itself is an indication of the role of ver- 
balization. Where verbalization is absent object assimilation 
is not as likely to operate. This is a problem which deserves 
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a special experimental investigation. We shall not be able to 
pursue the problem any further here. 

Graph III shows the effect of suggestion in recall on the 
relative depth of curvature in the reproductions of the ‘moon- 
letter C’ figure. It will be noted that there is a general trend 
toward decrease in curvature. This is due, in part at least, 
to the fact that nearly all of the subjects associated this 
figure with some sort of new moon in the learning period. 
The suggestion of ‘letter C’ in recall tends to hold the depth 
of curvature against the general tendency to make a more 
shallow construction. 

Graph IV shows another important change in this same 
figure. Since so many of the Os thought of this figure as a 
new moon in the learning period, there was a tendency to 
reverse the direction of the ‘horns’ or toturn them up. The 
suggestion of ‘letter C’ in recall tends to maintain correct 
direction of ‘horns’ as well as depth of curvature. 

In Graph V is shown the percent of reproductions showing 
a rounding of the top of the figure in the ‘bee hive—pilgrim’s 
hat’ figure. The suggestion of ‘bee hive’ in recall tends to 
increase the number showing this effect. The suggestion of 
‘pilgrim’s hat’ decreases slightly this tendency. Again the 
effect is especially evident after seven days. 

In the ‘jug-stirrup’ figure it is possible to count the number 
of figures in each group which show a definite break or turning 
point in the contour of the top of the figure ‘for strap attach- 
ment.’ Drawings showing a flange at the top, as in the first 
example of this figure in Chart I, are not counted since they 
are definitely of the ‘jug’ type. The percent showing this 
break are as follows: 

Suggestion ‘Jug’ Control Suggestion ‘Stirrup’ 
This break in contour is not enough in itself to classify the 
reproduction as showing the effect of the suggestion ‘stirrup,’ 
but it is significant that 45 percent of the cases show this 
characteristic under this condition. 

In the ‘eye-glass—dumbbell’ figure we express the length 

of the connecting line as a proportion of the length of the 
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whole figure. There is little difference shown between the 
three groups for immediate and 2-day recall, but after seven 
days the effect is quite evident. The suggestion is effective 
in both Suggestion Groups: with the suggestion ‘eye-glass’ 
the relative length of the connecting line is decreased; with the 
suggestion ‘dumbbell’ it is lengthened. The results for the 
seven day recall period expressed as average percentages are 
as follows: 


Suggestion ‘ Eye-glasses’ Control Suggestion ‘Dumbbell’ 
28% 34% 47% 


An effect of suggestion on the ‘hour glass-folding camp 
stool’ figure is to increase the variability of width to height. 
The suggestion ‘hour glass’ increases the height in relation 
to the width: the suggestion ‘folding camp stool’ has the op- 
posite effect. Width of the figure was taken arbitrarily as the 
width of the base of the top triangle of the figure. The re- 
sults for all three recall periods combined expressed as the 
average percent of the width to height are as follows: 

Suggestion ‘Hour Glass’ Control ‘Folding Camp Stool’ 

67% 71% 897% 
There is a slight tendency for increased differentiation with 
delayed recall, but this effect appears to operate from imme- 
diate recall. It would appear that this constancy is due to 
the fact that most learning associations are Roman numeral 
ten. The suggestion ‘folding camp stool’ has more effect 
presumably because ‘hour glass’ is more like Roman numeral 
ten. | 

In the ‘gun-horn’ figure there are several characteristic 
changes. The suggestion ‘horn’ tends to widen the end into 
a bell if that type of horn is thought of. The two lines on the 
stimulus figure serve as the breaking point for this bell. If 
another type of horn is thought of there are other changes such 
as is illustrated in Chart I. The suggestion ‘gun’ is likely 
to introduce the characteristic curve for the gunstock or to 
modify the two little lines for a trigger. Instances of this sort 
increase with delayed recall. At the seven day recall period 
one or both of these ‘gun’ modifications occur as follows: 


Suggestion ‘Gun’ Control Suggestion ‘Horn’ 
71% 30% 7% 
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The ‘seven-four’ and the ‘two-eight’ figures are very satis- 
factory for judging the influence of suggestion in recall al- 
though it is difficult to find a single characteristic for measur- 
ing. Different types of change are shown in Chart I. The 
‘frankfurter-canoe’ and the ‘pine tree-spearhead’ figures are 
dificult to handle in any way. In these figures we tried to 
improve on the verbal stimuli used by Carmichael et al. but 
without producing any marked characteristic differentiation. 


IV. Discussion oF RESULTS AND THEORETICAL 
INTERPRETATION 


It is unfortunate that the figures used in this experiment 
were not chosen with an objective measure in mind such as 
those presented in the second section of the results. The 
purpose in the beginning of the experiment was merely to use 
the method of Carmichael et al. Fortunately some of these 
figures were such that objective measures could be made. 
These objective measures verify the judgments. Further- 
more, they supplement them in that they give a more com- 
plete description of the changes which occur. 

It might be asked: How do you know that the figures 
reproduced were based upon a trace rather than the mere 
drawing of the figure named. In the first place, the Os were 
instructed not to draw a figure unless they had some memory 
of it from the learning period. There was the added precau- 
tion of having the subjects mark those figures which were 
drawn without memory of the perceived figures. A third 
precaution was taken. A group of Os who had not seen the 
figures were asked to draw the figures from the suggestions. 
An examination of these drawings shows many variations 
which never appear in the experimental groups. A few 
figures were discarded on the basis of these drawings. Since 
the rapport with the subjects was excellent, the authors feel 
that cooperation was secured on this point. The fact that 
over go percent of the subjects reported learning associations 
in recall is the best indication that they actually had some 
memory of the stimulus figure. 

It is not surprising that verbalization in recall should play 
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a role in determining the reproduction of visually perceived 
forms. It has long been known that verbalizations change 
with the passing of time. Stern has emphasized this point in 
his studies of testimony. In Shultz’s summary of Stern’s 
work [see Whipple (8), p. 39] he says that in repeating a report 
the repetitions “‘are based upon the memory of the verbal 
statements of the earlier reports rather than upon the original 
experience itself, 2.¢., the latter report undergoes distortion on 
account of the flexibility of the verbal expression.” Gary 
Myers (7, p. 69) reports in connection with memory for 
words: “The number of words substituted increases with time, 
and many of the substituted words seem to be the embodiment 
of the thought that is carried in memory rather than the 
word,” 

Certainly the trace which is the basis of a figure con- 
structed in recall is a more complicated structure than the 
simple ‘neural engram’ of a photographic-like perception of a 
figure or form. On the basis of the experimental results, a 
reaction theory of reproduction seems to be forced upon us. 
O has a task to perform—the task of constructing a figure 
which has been reacted to in a certain situation in the past. 
He may remember that the figure looked like some object, 
or remember some verbal analysis made during the study of 
the figure. He may remember, too, some of the exceptions 
which kept the figure from being an exact representation of 
the associated object. There are details which were not 
reacted to specifically in perception hence these have to be 
produced in construction. The task of construction is likely 
to be accompanied by some verbalization more or less like 
that which occurred in the learning period. His construction 
is based upon these learning reactions, these reactions in recall, 
and a general knowledge and experience with figures. 

Verbal suggestion in recall in the present experiment ap- 
pears to operate in several ways as an aid in this construction. 
In many cases the suggestion enables O to contact the trace of 
the figure and the learning verbalization. If he remembers 
the latter vividly he may disregard the suggestion in recall. 
On the other hand, if the learning association is indefinite, 
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he may depend to a great extent upon the suggestion to fill 
in the unclear characteristics of the trace. Often the in- 
fluence of both the learning association and the suggestion in 
recall is evident. With increased delay in construction, the 
suggestion in recall operates more and more as a factor in the 
production of the figure. 

If there is no suggestion given in recall, as with the Control 
Group in this experiment, more reliance is placed upon the 
experiences accompanying the perception of the figure. With 
increased delay, O seems to strive more earnestly for a re- 
construction of these experiences. As pointed out in Table I, 
there is an increase in the percent of figures showing the 
effect of the learning association, and a decrease in the 
percent showing changes which cannot be explained by such 
associations. It is in this activity of striving for reconstruc- 
tion that verbalization in recall is likely to operate. There is 
forgetting of the experiences accompanying the learning of 
the figure just as there is forgetting of the detailed character- 
istics of the figure. Hence we have the use in construction of 
ideas only approximating the original experience, and con- 
sequently, a transformation of the produced figure. 

Since there is clear-cut evidence to demonstrate the in- 
fluence of the mode of apprehension upon the reproduction, 
the increasing unclarity of the trace with delayed construction, 
and the constructive nature of the reproduction in the recall 
period, it is not necessary to assume with Wulf (11) and 
Koffka (5, pp. 493 ff.) that an hypothetical ‘neural engram’ 
gradually transforms toward a better (a more pregnant) trace 
during retention, and that the reproduced figure is a repre- 
sentation of this better trace. If the reproduction can be 
partly transformed by verbal suggestion in the learning 
period and by the same means in the recall period, it is very 
likely that any relevant experience occurring between these 
two periods or previous to them can be effective during the 
construction in recall. We would also expect this to be true 
in recognition to a less marked degree. 


(Manuscript received January 17, 1939) 
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PROBABILITY AS A DETERMINER OF 
RAT BEHAVIOR 


BY EGON BRUNSWIK 
University of California 


INTRODUCTION 


In the natural environment of a living being, cues, means 
or pathways to a goal are usually neither absolutely reliable 
nor absolutely wrong. In most cases there is, objectively 
speaking, no perfect certainty that this or that will, or will 
not, lead to a certain end, but only a higher or lesser degree of 
probability (cf. Tolman and Brunswik, 9). The psychology 
of learning reflects only occasionally this trait of the environ- 
ment (e.g. Thorndike, 6, '7).1. Usually the connection between 
means and end is made by the experimenter to be what Hume 
or John Stuart Mill would call indissoluble or inseparable, one 
of the alternative behaviors being always rewarded and the 
other never. | 

The need to study ambiguity experimentally became espe- 
cially urgent in the psychology of perception. The natural 
cues used by the perceptual system are, without exception, 
ambiguous in character. This, in its turn, leads to ambiguity 
of the response as well. As indicated in recent studies (1, 2), 
the perceptual system under certain conditions will learn to 
make use of new kinds of cues even if they do not prove to be 
highly reliable. 

The present report deals with an attempt to study the 
learning response of organisms to ambiguous means to a cer- 
tain end, within the field of overt action, in a way comparable 


1 The usual case of an occurrence of ‘exceptions’ in training experiments is the 
insertion of test trials within a‘series of conditionings. Experiments of the type such 
as recently published by Skinner (5) in which only one out of a large number of trials 
was rewarded differ from ours insofar as the sequence of rewarded versus non-rewarded 
trials was not a chance order and that the animals were not faced with a discrimination 
problem between two different kinds of probability patterns. 
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to the studies in perception. Rats were used as subjects. 
A single choice situation was given’ In a series of experi- 
mental conditions the relative frequency, or the degree of 
‘probability’, of reward on the two sides was varied. The 
question was to what extent the animals would be able to 
learn to prefer the more frequently rewarded side to the less 
profitable one, or, in short, to distinguish a better chance of 
reward from a less good one. In the ‘main experiments,’ five 
different combinations of probability were used. In addi- 
tional ‘danger experiments’ two of these combinations were 
repeated with punishment added in each instance of an unsuc- 
cessful choice. In both of these experiments the original 
training was reversed after a number of trials in order to study 
relearning of probabilities. A further group of special experi- 
ments was performed in order to throw light upon some partic- 
ular problems to be discussed later. 


The Experimental Procedure.—The set-up consisted of a one-unit elevated T-maze. 
The paths on both sides were equal in length, namely 115 cm. The approach, from 
the starting point to the choice point, was 60 cm long. The breadth of the path was 
4-5 cm, its height above the floor 80cm. Before entering the food compartment, the 
animal had to pass a one-way door. ) 

Wet food was used as a reward, and the same amount was given each day. In 
those cases in which no reward was given on one of the two sides (see below), the door 
to the food compartment at the end of the respective path was closed and the animal 
had to turn and go to the other door. In each trial the animal finally obtained food. 
The food was evenly distributed over the various runs in such a quantity that no extra 
feeding was necessary. The weight of the animals was controlled over the period of 
the experiment, an attempt being made to keep it approximately constant. The 
training was preceded by a hunger period which was kept constant for all animals. No 
preliminary training was given. 

In all of the main experiments the ‘original’ training consisted of 24 runs for each 
animal. Four trials were given on the first day, four on the second, eight on the third, 
and eight on the fourth. After the original training had been completed, ‘reversed’ 
training consisting of 16 more runs was given on the two following days with eight runs 
on each day. The length of the training periods was chosen after preliminary experi- 
ments had shown their suitableness. 

In the main experiments five different training groups, or types, with 48 animals 
in each group were used. The first of these groups was trained in the usual, unam- 
biguous all-or-none fashion, one side being always rewarded, the other side never. 
The objective chances for the animal to find food on side A as against the chances to 
find food on side B were, therefore, 100 percent to zero. Objectively speaking, the 
effect on either side was predictable with certainty. We may call this group the 
: 0” group. 

A second group found food on side A only in 3/4 of the cases, whereas side B war 
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rewarded in the remaining 1/4 of the cases. These latter cases might be called the 
‘exceptional’ cases, as contrasted to the regular ones. The objective chances of suc- 
cess were 75 percent to 25 percent on the respective sides. The second training group, 
therefore, shall be labeled ‘75 : 25.’ 

The exceptional cases were randomly distributed among the regular ones in an 
artificial chance order (see below). Let us call A the ‘generally more profitable,’ or, 
in short, the ‘profitable’ side. B might then be called the ‘generally less profitable’ 
_ orthe ‘unprofitable’ side. Let us call each choice of the more profitable side a ‘correct 
response’ and each choice of the less profitable side an ‘error.’ The terms ‘error’ and 
‘unsuccessful choice’ are therefore not used synonymously in this study due to the 
particular conditions of our experiment. In all of the exceptional cases the suc- 
cessful choice is an error; and the unsuccessful choice is a correct response since the 
generally better chance was preferred to the less good one. 

A third group was treated in a way analogous to the second, with the only excep- 
tion that the probability of reward on the two sides was 2/3 against 1/3. We call this 
group ‘67 : 33.’ 

The three above mentioned groups have in common the feature that for every 
run there is food on one and only one side. The two sides are complementary to each 
other and the ultimate end of each run is fully predetermined. 

A further group found food on side A in all of the cases and, in addition to this, 
on side B in 1/2 of the cases. Wecall this group‘1oo: so.’ It differs from the remain- 
ing four groups insofar as the ultimate end of each run is not determined beforehand, 
since in half of the cases a choice of either side leads to food. 

The last training group might be called ‘50: 0,’ food being given only on one of 
the sides in half of the cases. In the remaining half of the cases the animal found both 
doors closed. In such cases it had to return to the choice point once more, was picked 
up there by the experimenter and rewarded in a food box placed in a neutral position. 

In Table 1 a survey of the five training conditions is given. The order of presenta- 
tion is rearranged in such a fashion that a decreasing series is formed with regard to the 
difference and to the ratio of the probabilities of reward on the two sides, whereby for 
each step there is either a decrease in the former or in the latter, the remaining feature 


TABLE 1 


SuRVEY OF TRAINING CONDITIONS 


Row 


(1) | Reward on profitable vs. 
unprofitable side in terms 
of percent of trials, indi- 
cating proportion of resp. 


650640 00 100:0 50:0 75:25] 100: 50 : 33 
(2) Difference of probabilities.} 100 5° 50 50 33-3 
(3) | Ratio of probabilities. . .. .| Infinite Infinite 3:8 2:1 2:8 
(4) | On which side is training 
unambiguous? .......... Both Unprofitable | None | Profitable} None 
(5) | Is final point of each run 
predetermined?.......... Yes | Yes. (Inhalfof} Yes No Yes 
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being held constant. (See also the graphical representation in Fig. 3.) Row 4 refers 
to the trait of absolute objective certainty vs. uncertainty which will be considered later. 

In the reversed training period for each group the profitable side now was made 
unprofitable and vice versa. The proportion of the respective probabilities, however, 
was kept the same for each group as in the original training. 

In order to study the effect of the kind of ambiguity produced by a repeated rever- 
sal of training, group 100 : O received additional training for six consecutive days after 
the main experiments had been concluded. On each day eight trials of the 100: 0 
type were given but the direction of the training was reversed each day as compared 
to the day before. On the thirteenth and last day of the total training period a single 
additional trial was given in order to get data corresponding to the initial trials of the 
preceding phases. 

Each group of 48 animals was divided into 4 sub-groups of 12 animals each. In 
each sub-group for one half of the animals the right side was made more profitable, 
for the other half the left. The experiments were conducted in four temporally 
different periods. In each period one sub-group out of every training group was run 
in order to secure the greatest possible technical homogeneity between the training 
groups. For the same purpose, the animals in the various groups were approximately 
matched with regard to age, sex, and ancestry. A total of four animals died during the 
course of the experiment. They were replaced by other animals added to the last of 
the corresponding sub-groups. 

The sequence of the regular vs. the exceptional cases was varied from sub-group to 
sub-group in such a fashion that for each training group an equal distribution of the 
exceptional cases was obtained. As an example, Table 2 shows the distribution of the 


TABLE 2 


EXAMPLES OF THE ARTIFICIAL CHANCE SEQUENCES OF NON-REWARDED AND REWARDED 
CASES ON THE Less PROFITABLE SIDE FOR THE First Eicut Runs 
+ indicates food, — no food 


Training group 75 : 25 Training group 100: 50 
Run Sub-groups Sub-groups 
Number 

I II III IV I II III IV 
+ + 4 - 
2 -}] + +4] - 
3 +] - + 
4 - | +4 - 
5 - | + 4] - 4 + 
6 - +] - 4] + 
7 | +] +4 - 
8 +] +] - 4] - + 


rewarded cases among the non-rewarded on the unprofitable side for the four sub- 
groups of training group 75 : 25 during the first eight trials. In the following trials 
the sequences shown were interchanged among each other. Table 2 shows, further- 
more, the analogous scheme for training type 100 : 50. 

In addition to the main experiments two further experiments of the types 75 : 25 
and 100 : 50 were conducted. They differed from the corresponding main experiments 
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insofar as every unsuccessful attempt to reach food was punished by an electric shock. 
As suggested above we may call these experiments ‘danger experiments,’ since not only 
varying probabilities of success, but also of detriment were involved. A further dif- 
ference between main experiments and danger experiments was the omission of the 
fourth day of the original training. This was done in order to maintain an approxi- 
mately equal level for the results of the original training of the main experiments and 
of the danger experiments yielding an even start for relearning. The number of animals 
was, as in the main experiments, 48 in each group, subdivided into subgroups of 12. 
Thus the total number of animals used in the study was brought up to 336. 

In the danger experiments, an electric grid of 30 centimeters length was placed in 
front of each of the doors and charged with 35 to 50 volts in every case in which the 
door behind it was closed. As in the main experiments, an animal which once had 
chosen his side had to proceed entirely up to the door and then attempt to open it. 
In those infrequent cases in which the animal attempted to turn after the first touch of 
the shock, he was forced back by the experimenter until he had tried to open the door. 

For all of the conditions, the experiments with three of the sub-groups were per- 
formed in the spring of 1936, whereas the fourth sub-group was added in 1937. The 
study was conducted at the University of California in Berkeley. The author is 


indebted to a seminar-group under the direction of Prof. Edward C. Tolman for valuable 
suggestions and criticisms. 


A number of preliminary experiments in which a somewhat different scheme of 
training and experimental technique was used, and in which the training for each 


animal was extended over a period of several months, will be reported in a separate 
publication. 


RESULTs AND DIscussIOoN 


Figure 1 shows the results of the main experiments for 
the five training groups. Let us consider first the original 
training. Asa test for the rat’s response we choose the results 
of the last eight trials. The quantitative data are given in 
Table 3. The averages and the sigma scores are computed 
from the number of ‘errors,’ that is, those cases in which each 
rat chose the generally less profitable side, during the last 
day of the original training. Row 3 shows the averages re- 
computed as percentage scores. In the remaining three rows 
of Table 3 a series of critical ratios to be discussed later is 
given. 

(1) Threshold of Probability and of Certainty.—Learning, 
that is discrimination of probabilities, takes place definitely 
for all of the groups except training type 67 : 33. According 
to Row 3 the deviations from chance level, that is from 50 
percent, are as high as 22.4 even in the case of training type 
100 : 50 which yields the smallest amount of learning after 
type 67 : 33. The CR’s move within the range from 4.0 to 
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TABLE 3 
Main Experiments. ReEsuLts oF ORIGINAL TRAINING 


The first row shows the average number of times an animal chose the unprofitable 
side during the eight trials of the last day of the original training. In the second row 
the sigmas of these averages are given. In Row 3 the data given in Row 1 are ex- 
pressed, in correspondence to Fig. 1, in terms of average percentages. The chance 
level is at 50 percent. In the remaining rows a list of some of the more important 
critical ratios is given. 


Row | Training Groups in Terms of Probability of 


(1) | Average runs to unprofitable side in last 

(3) | Percentage error scores computed from 

7:0 | 15.6 | 20.2 | 27.6 | 46.9 
(s) | CR with training group 67 : 33........| 6.5 4.5 3.6 2.4 = 
(6) | CR with training group 100:0........ _— 16 | 2.5 | 36 | 65 


TABLE 4 
DANGER EXPERIMENTS, ORIGINAL TRAINING. 


Results given in analogy to Table 3. Row 5 shows the corresponding scores of 
the main experiment. Note that the comparison had to be made with the results not 
of the fourth but of the third day of the main experiment. Row 6 indicates the sig- 
nificance of the differences between the scores given in Row 3 and in Row s. 


Row Training groups in terms of probability of 
75:25 100 : 50 
Corresponding probability of punishment. . 25:75 0:50 
(1) Average runs to unprofitable side in 
(3) Percentage error scores computed 
(5) Corresponding percentage score of 100: 50 50:0 
main experiment (third day)....... 26.0 26.5 22.6 
(6) CR of difference between danger score 


16.7 (Row 4). For training group 67 : 33, on the other hand, 
the deviation from chance is only 3.1 percent. As shown 
by the low CR of .5, this difference is negligible. 

Let us compare training type 67 : 33 directly with the four 
remaining conditions. As is shown in Row 4, the first three 
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differences are unquestionably above chance. Since, further- 
more, according to Fisher (cf. 3, pp. 62 ff, 548), with a number 
of animals like ours CR’s of 2.0 indicate ‘significant’ differ- 
ences and CR’s of 2.7 ‘very significant’ ones, even the dif- 
ference between conditions 100 : 50 and 67 : 33 with a CR of 
2.4 can be called significant. As far as the training given is 
concerned, among all the conditions 75 : 25 seems to be par- 
ticularly similar to 67:33. Considering the shortness of 
the training the number of ‘exceptional’ cases is 8 and 6 
respectively, yielding a difference of only 2. Yet, as is shown 
by the CR of 3.6, in connection with the data concerning the 
results of training group 67 : 33 mentioned above, condition 
75:25 is definitely above, whereas 67 : 33 is below what 
might be called the ‘threshold of probability’ under the 
general conditions of our experiment.” 

It was to be expected that the traditional unambiguous 
training type 100 : 0 would yield the best results. Indeed its 
error score of 7.0 percent is not farfromo. It is, as shown by 
the last row of Table 3, significantly different from the scores 
of all the other conditions except training group 50:0. The 
remaining conditions can, therefore, be called more or less 
definitely below the ‘threshold of certainty’ under the general 
conditions of our experiment. By saying this we mean that 
the response is clearly distinct from the response found in the 
case of an unambiguous training of the same length. 

(2) Influence of Probability Difference.—The rank order 
of scores follows exactly the order of training conditions as 
shown in Table 1 on the basis of a combined consideration of 
the order of the differences (Row 2) and of the ratios (Row 3) 
of the probabilities of success on the two sides of each of the 
probability discrimination problems. 

Discrimination seems, therefore, firstly, to increase with an 
increasing difference of probabilities even when their ratio is 
held constant.. As was mentioned above, training groups 


2 Preliminary experiments of the present author on learning of ambiguous cues in 
perception (1) seem to indicate that under the conditions of these experiments, in 
contrast to the present findings, the perceptual system is able to learn cues even if their 
indicative value is as low as 67 : 33, yielding a tetrachoric correlation coefficient 
between the cue and the event indicated of not more than r = .33. 
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100 : 50 and 67 : 33 react differently to a significant degree 
though they both have the same probability ratio of 2 : 1 and 
differ from each other only by having probability differences 
of 50 versus 33.3. Though below significance, the difference 
between training groups 100 :0 and 50:0, with probability 
differences of 100 and 50 respectively, shows markedly the 
same trend in spite of the fact that for both the ratio of 
probabilities is infinity. 

(3) Tendency toward Weber Law of Probability —Secondly, 
when the difference of probabilities is held constant, some 
indication is given that an influence is superimposed which is 
due to the varying ratio of probabilities, per se. We have in 
mind the three conditions 50:0, 75 : 25, and 100: 50. For 
all of them the probability difference is 50 whereas the 
probability ratio decreases from infinity to 3:1 and 2:1 
respectively. The error scores in Table 3 increase from 15.6 
to 20.2 and 27.6, that is, learning decreases slightly with 
decreasing ratio of probability. In our main experiment, 
however, this tendency is not quite significant. The CR of 
the difference between training groups 50:0 and 75: 25 is 
only .8 and between training groups 75 : 25 and 100: 50 only 
1.0. Between 50:0 and 100: 50, however, there is a CR of 
1.8 which comes near to the value of 2.0 given by Fisher as 
the lower limit of significance for a number of subjects equal 
to ours. Taking the gradual increase of the scores of the 
three conditions as a whole, there is still more reason to be- 
lieve that their sequence is not due to chance though no stand- 
ardized statistical means were used to express this increase 
in significance quantitatively. 

Further evidence for the influence of the probability ratio, 
per se, is given by the results of the ‘danger’ experiments, in 
which an electric shock in each case of an unsuccessful choice 
was applied. The data are given in Fig. 2 and Table 4 in 
a way similar to the main experiments. As follows from 
Rows 1 and 2 in Table 4, the CR of the difference between the 
two ‘danger’ conditions 75 : 25 and 100: 50 is 2.9. ‘This is 
more than the lower limit for ‘very significant’ (according 
to Fisher 2.7). It exceeds also by far the corresponding CR 
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for the main experiments in spite of the fact that the main 
experiments had a longer training period. For further dis- 
cussion ¢f. Sect. (5). 

It is to be noted, furthermore, that the direction of the 
influence of the probability ratio upon discriminative learning 
is reversed in the case of the ‘danger experiments.’ As was 
shown in the main experiments, learning is fairly likely to 
increase with the probability ratio, per se, of reward. How- 
ever, it decreases markedly when an emphasis is put on the 
non-rewarded side by the introduction of punishment. Learn- 
ing then increases with an increase of the probability ratio of 
punishment. Apparently the avoidance of punishment be- 
comes a stronger motive as compared with the anxiousness of 
the animal to reach food as quickly as possible. If we con- 
sider not food, but rather punishment the ‘stimulus’ most 
active in the danger experiments, danger experiment 100 : 50 
becomes representative of main experiment 50 : Oo rather than 
of main experiment 100 : 50; condition 75 : 25, however, does 
not change its formal character (cf. the top part of Table 4). 
If this aspect is considered, the reversal of results mentioned 
above disappears and the results of both the main experiments 
and the danger experiments can be expressed simultaneously 
in the following generalized way: Discrimination of probabili- 
ties tends to increase with the increase of the ratio of the 
probability of emphasis * on the two sides of the probability 
discrimination problem. 

This statement bears a certain resemblance to the Weber 
Law as formulated in the psychophysics of sensation. Since 
the present study is to be considered a contribution to what 
might be called the ‘psychophysics of probability’ the above 
result could be interpreted as a tendency toward a ‘Weber 
Law of probability.’ It should be kept in mind, however, 
that the analogy cannot be stretched so far as to assert an 
independence of the learning effect from factors other than 
the probability ratio. As was shown above, the probability 


3 It has to be noted that in the present connection the term ‘emphasis’ is used in a 
somewhat different sense than in the study by Tolman, Bretnall and Hall (10) where 
it was introduced to designate the fact that it is easier to learn to go to an emphasizer 
than away from it. 
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difference, per se, has an influence too, and this influence 
appears to be even more marked than that of the probability 
ratio. The strength of the response appears to follow rather 
a kind of compromise of the two tendencies exerted by 
probability difference, on the one hand, and probability ratio, 
on the other. This is, however, not an obstacle in the way 
of comparing our findings to those related to the Weber Law 
of traditional psychophysics, since some results of the psycho- 
physics of sensation could similarly be interpreted as devia- 
tions from the Weber principle in the manner of a compromise 
with other factors such as stimulus difference, though the role 
of the ratio of the stimuli usually is much more marked than 
in our Case. 

A graphical representation of the conditions, and, of 
the results discussed in the last two sections, can be found in 
Fig. 3. Note how the curve of the original learning shows a 
continuous downward trend though this trend is accompanied 
merely by a fall of either the curve representing the probability 
differences or the curve representing the probability ratios but 
never of both at the same time. Note furthermore the cor- 
respondence in the directions of the curves for the main 
experiment and for the danger experiment where the training 
conditions for the latter are expressed, as suggested above, in 
terms of punishment and not of reward. 

(4) ‘Frequency’ vs. ‘ Frequency-Configuration.’—As implied 
in the statements of the foregoing paragraphs, our results 
can be understood only as reactions to the total pattern of 
frequencies of reward and non-reward on the two sides of the 
discrimination problem, of which ‘probability difference’ and 
‘probability ratio’ are but single aspects. An attempt to 
explain the results in terms of isolated frequencies could not 
be generally justified. Though it might seem possible, for 
example, to look at the 50:0 curve as possessing half the 
acceleration of the 100: 0 curve, for which the frequency of 
reward on the profitable side is twice as large as for condition 
50 : 0, yet in other cases a similar consideration would fail. 
For example, conditions 100 : 0 and 100 : 50 possess the same 
amount of reward on the profitable side and yet the two curves 
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are significantly different. Or: curve 67 : 33 has at the end 
of the original training not reached the level which was at- 
tained by curve 75 : 25 after 21 trials (that is, at the middle 
of the last day of the original training), in spite of the fact 
that under these conditions the absolute frequencies of reward 
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Fic. 3. Graphical survey of training conditions and of results. 


on the profitable side are equal for the two conditions. In 
short, the traditional notion of ‘frequency’ (or of ‘relative 
frequency’) as a factor influencing learning has to be amplified 
to that of a ‘frequency pattern,’ of which isolated frequencies 
are but special aspects applicable under the limited conditions 
of a non-ambiguous training. 
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It goes without saying, however, that not only frequency, 
but also recency as an isolated factor could not account for 
our results, since in that case all curves would have to assume a 
strictly horizontal trend, due to the rectangular distribution 
of the ‘exceptional’ cases as outlined in Table 2 and the cor- 
responding test. No special attention was given in this study 
to the effect of recency, per se. 

(5) Is ‘Being without Exception’ an Emphasizer?—Objec- 
tive inseparability of good or bad means-end relationships is 
considered to be an important factor in human life. In such 
cases there is either a 100 percent or a O percent probability 
for a certain kind of happening. It is shown in Row 4 of 
Table 1, in what way our conditions differ with respect to the 
presence or absence of strict non-ambiguity on the two sides 
of our discrimination problem. The trait of certainty, or of 
‘being without exception,’ is more frequently associated with 
the conditions represented on the left side of the Table. 
These conditions, at the same time, are those for which learn- 
ing is better. One might, therefore, try to ascribe discrimina- 
tion not so much to the probability differences or probability 
ratios—represented in Row 2 and Row 3—but chiefly to non- 
ambiguity versus ambiguity of training on one or both sides 
as represented in Row 4. 

A more detailed inspection shows, however, that such an 
argument can be excluded. Firstly, conditions 75 : 25 and 
67 : 33 both entirely lack ‘certainty.’ In each of these cases 
there are ‘exceptions’ from the rule on both sides of the dis- 
crimination problem. Nevertheless, the results of the two 
respective training groups are significantly different, as was 
outlined above. Secondly, considering security as such, 
condition 75 : 25 is worse off than its two neighbors, 50 : o and 
100: 50. There is, however, as Row 3 of Table 3 shows, not 
the slightest indication that any effect is superimposed on that 
of the variation of the probability ratio among these three 
conditions. In Fig. 3 this is indicated by the straight course 
of the middle part of the original learning curve connecting 
50:0 with 100: 50. We have to conclude, therefore, that 
for our material it is chiefly the high probability difference 
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and the high probability ratio, and not lack of ambiguity on 


the particular sides of the discrimination problem, per se, 
which yields advantage to the discrimination problems on 
the left side of Table 1. 

Concretely speaking, our results tend to indicate that, for 
instance, a probability discrimination 95 : 5 (having only one 
‘exceptional’ case out of 20) would be almost as easy to learn 
for the rat as is 100:0. It seems to be a particular feature 
of the discursified aspects of human behavior to overemphasize 
the ‘always’ and the ‘never,’ that is, for example, to draw a 
sharp boundary line between the exceptionless validity of 
logical or physical ‘laws,’ on the one hand, and ‘merely 
statistical’ regularities or high probabilities of a less unam- 
biguous and therefore much less principal character, on the 
other. This attitude, however, might be rather strange to an 
animal of a more primitive status of organization, like the rat, 
as it is likewise strange to the intuitive system of human 
perception or to the chief core of our human practical activities. 

The possibility might not be excluded that in the case of 
the danger experiments the advantage of condition 50 : 0 (in 
terms of punishment) as compared with 75 : 25 is due not so 
much to the decrease of the probability ratio, per se, but 
rather to an influence of complete security from punishment on 
one side of training pattern 50:0. The question could not 
be decided quantitatively unless experiments under other 
conditions (¢.g. §5 : 5, or 100: 50, both in terms of punish- 
ment) were available for comparison. It should be kept in 
mind, however, that there was no indication of an effect of 
certainty, per se, in the case of the corresponding main 
experiments. To a certain extent all discussions of this kind 
become futile due to the fact that complete security always 
goes with extreme probability ratios and thus the two alterna- 
tive interpretations become synonymous at least as far as the 
borderline types of training are concerned. 

(6) Relearning of Probabilities—Let us turn now to the 
results of the reversed training. As is shown in Fig. 1, by far 
the best result was obtained with training group 100:0. 
As pointed out above, the advantage of this condition as 
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compared with some of the other training types was not very 
marked during the original training period. This holds espe- 
cially with regard to condition 50: 0, yielding with 100:0 a 
CR of not more than 1.6 for the last day of the original training 
(cf. Row 6 of Table 3). The last day of the reversed training 
shows a marked difference between these two conditions with 
a CR of 8.2 in spite of the fact that it had also to compensate 
for an opposite difference established in the original training. 
In the case of relearning, therefore, condition 50 : O seems to be 
definitely below the threshold of certainty (cf. above Sect. 1). 
Two probability patterns mastered with approximately equal 
success in learning might, therefore, be clearly differentiated in 
relearning. In addition to learning, relearning thus seems to 
be a sensitive instrument in testing probability discrimination. 

The differences between the percentage error scores of 
the last day of the original training and those of the last day 
of the reversed training, as well as their CR’s, are given in 
Table 6. There is, according to Fisher, a significant trend 


TABLE 5 


Resutts oF Main EXPERIMENTS AT THE END OF REVERSED TRAINING 


Average number of runs per animal to formerly profitable, now unprofitable side, for the 
eight trials on the second and last day of the reversed training 


Training Type....... 100: 0 50:0 23228 100 : 50 67 : 33 

Average runs...... 7.12 3-04 2.52 3.67 4.50 

Sigma of average... .18 47 51 52 
TABLE 6 


RELEARNING OF PROBABILITIES IN THE MAIN EXPERIMENTS 


The first row shows the amount of relearning (or unlearning), that is the dif- 
ference of the average percentage scores of the last eight trials of the original training 
as given in Table 4 and of the last eight trials of the reversed training. The second 
row shows the critical ratio of this difference. 


:) 100:0 50:0 75:25 100 : 50 67 : 33 


Difference of percentage 
score of original vs. re- 
versed training....... 82.0 22.4 11.3 18.3 

CR of this difference. ... 23.4 3.2 1.4 2.1 
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in the direction of extinction or relearning in all instances 
except training types 75 : 25 and 67: 33. In the latter case, 
however, not even learning could be observed. The close 
resemblance between the difference scores and the CR’s for 
training types 75 : 25 and 67 : 33 indicates that in this case 
two conditions, though significantly differentiated in learning, 
do not differentiate in relearning. ‘The additional confusion 
introduced by the reversal of the probability pattern might 
be effective in bringing condition 75:25 down below the 
threshold of probability. 

It should be noted, furthermore, that the two conditions 
75:25 and 67 : 33 which are the worst off in relearning at the 
same time are characterized by ‘ambiguity’ of training on 
both sides of the discrimination problem. This might be an 
indication of the fact that at least in the case of relearning of 
probability patterns the feature of ‘being without exception,’ 
per se, could be helpful. This issue was discussed in the 
preceding section on original learning but answered negatively. 
As was suggested above, the amount of ambiguity added by 
the reversal of the training might be bearable in case of a 
fairly clear-cut pattern but might break down orientation in 
case of a high degree of ambiguity intrinsic to the probability 
pattern itself. 

In Fig. 3 the differences mentioned can be seen graphically. 
Whereas the middle section of the curve for original learning is 
approximately a straight line, the relearning curve shows a 
dip at the point 75 : 25. Since the middle part of the prob- 
‘ability difference curve is a straight line and the middle part 
of the probability ratio curve shows a continuous downward 
trend, the deepening mentioned might be due to the effect of 
the two-fold ambiguity of condition 75 : 25 superimposed on 
the effect of the probability ratio. 

The results of the reversed ‘danger’ experiments are shown 
in Table 7 and Fig. 2. Relearning is significant for both 
conditions. As in the case of original learning, training type 
100 : 50 (that is, 50:0 in terms of punishment) shows the 
better results. As can be seen from Fig. 3, on the whole, 
relearning is much better in the case of the danger experiments 
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TABLE 7 


Resutts OF DANGER EXPERIMENTS AT THE END OF REVERSED TRAINING 
Data given in analogy to Tables 5 and 6 


Tyee im Forme of 75:25 100 : 50 
Training Type in Terms of Punishment....................22-2005 25:75 0:50 
Corresponding to Table 5 
Corresponding to Table 6 
Percentage difference original vs. reversed training. .. .46.9 73.9 


than in the case of the main experiments, though learning had 
been stopped at a point of approximate equality of the training 
effect. This might be due either to a greater efficiency of the 
danger set up, as such, or to the smaller amount of fixation of 
the original training due to a smaller amount of repetition. 
It seems plain, however, that the principle of emphasis as 
outlined above holds not only for learning but also for 
relearning. 


(7) Individual Patterns of Experience-—As an objection to our evaluation of the 
results it could be said that the probability patterns used in our training are defined in 
purely objective terms, that is, from the standpoint of the possibility of success and 
failure as arranged by the experimenter and not by the pattern of success and failure as 
experienced by the animal. This latter pattern might vary from animal to animal 
according to the individual choices made. 

So, for instance, a rat trained after the scheme 100 : 50 might accidentally never 
choose the less profitable side ‘50’ except when this side happens to be rewarded. Such 
an animal would be likely to become ‘optimistic,’ that is, to build up the hypothesis 
that whatever choice it makes would be successful. For this particular rat pattern 
100 : 50 would actually be a pattern 100: 100. Though less likely than 100: 50, 
even patterns 75 : 25 or 67 : 33 could thus become effective as patterns 100 ;: 100 if 
only the particular animal always would happen to choose the side rewarded in the 
respective trial. Other animals might equally well become thoroughly ‘pessimistic’ 
if they accidentally should choose the non-rewarded side in every single instance. In 
those cases the pattern would become, in the eyes of the rat,ao: Opattern. Numerous 
other varieties would be possible. In particular, there is a certain chance that some 
animals trained after the scheme 100 : 50 would experience exactly the same pattern 
as some animals trained after the scheme 75 : 25 or 67 : 33. 

A special situation with regard to the patterns experienced is also given in those 
cases in which an animal becomes fixated on one of the two sides. Experience then 
is limited to this side except in those cases in which it is not rewarded and in which 
the animal therefore has to turn to the other side. Except for training type 50 : © this 
other side would then be experienced without exception as a success. Furthermore, 
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for animals fixated on the correct side the training would become exactly alike under 
conditions 100 : 0 and 100: 50. 

To the objections raised the following reply is to be made. For all of the training 
types except 100: 50 the position of the final goal of each run is predetermined and 
becomes known to the animal in every instance. The two sides are complementary and 
there is never food on both sides. It is only the additional amount of negative in- 
formation given in the case of an unsuccessful choice which may vary from animal to 
animal. Furthermore, for all the training types except 100: 50 and 100:0 each 
animal is forced to have experiences on both sides during the course of the experiment 
and thus to give up strict fixation on one side. Finally, some of the instances men- 
tioned are constructions of extreme cases whose practical occurrence seems obviously 
to be highly improbable. 

In addition to these arguments, however, an empirical investigation concerning 
individual animals seemed to be desirable. Therefore a detailed analysis of the 24 
trials of the original training of all the individual animals used in our study was per- 
formed out of which the following results should be emphasized. 

As far as the interrelation of training groups 100: 0 and 100: 50 is concerned, 
there is fixation on the correct side throughout the whole course of the original training 
in 11 and 8 cases respectively. To the latter group (of 48) thereby 7 animals were 
added which were trained according to training plan 100 : 50 during the original train- 
ing but had to be replaced because of an error committed by the experimenter during 
the reversed training period. In addition to the 8 animals fixated on the correct side 
there were 6 animals in group 100: 50 which never found their choice of the less 
profitable side successful. There were, therefore, in group 100: 50 14 animals out of 
55 which received training which also would have been possible under condition 100 : o. 
As a compensation, there were 6 animals of the ‘optimistic’ type in group 100: 50 
which found food on the less profitable side in every instance they chose it. Comparing 
the improvement from the first eight to the last eight trials of the original training for 
these 20 animals, representing incomplete or deviating types of experience, with the 
improvement of the remaining 35 animals of this group, we find rather similar scores, 
namely .gt and .75 respectively. In contrast to that the analogous score for the 48 
animals in group 100 : O is 2.15, and it must be remembered that in this latter group 
there were 11 animals fixated on the correct side who therefore were unable to make 
any progress at all. This suggests that the deviating animals can well be included 
with the group representing, as a whole, training type 100 : 50. 

In groups 75 : 25 and 67 : 33 there were 5 and 3 animals respectively who were 
fixated on the correct side and 1 and 2 animals respectively who were fixated on the 
incorrect side. As was said above, however, all of the 96 animals in these two groups 
were forced to have experiences on the other side of the discrimination problem in one- 
fourth to three-fourths of the cases. This feature marks them off from the animals 
fixated on the correct side in groups 100: O and 100: 50. Furthermore, there were no 
cases of ‘optimism’ or ‘pessimism’ suggested to the animals by the incidentals of their 
training history. In group 75 : 25, there were only 2 animals who experienced, out 
of the 24 trials, less than 6 failures (namely 4 and 5) and only 3 animals who experienced 
more than 16 failures (namely, 18, 18 and 19). Similarly, in group 67 : 33 there were 
only 2 animals experiencing less than 8 failures (namely 7 and 7) and no animals 
experiencing more than 16 failures. There were, furthermore, only 2 animals in group 
75 : 25 and none in group 67 : 33 who did not experience at least 2 failures on the cor- 
rect side in case they chose this side more than 8 times. And there were only 3 animals 
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in group 75 : 25 and none in group 67 : 33 who did not experience at least one success 
on the incorrect side in case they chose this side more than 2 times. It can be con- 
cluded that the patterns of experiences made by the individual animals are to a con- 
siderable extent uniform within the respective group and conform fairly well with the 
intended training types. These groups may therefore be taken, as a whole, as true 
representatives of the respective probability patterns. 

(8) Repeated Reversals.—Figure 4 shows the results of the repeated reversal training 
of the training group 100: 0. In contrast to Figs. 1 and 2 the results are expressed in 
absolute numbers of runs. Furthermore, in order to be able to compare graphically 
the successive periods, in each case of the reversal of training the curve is interrupted 
and drawn in such a way that the responses which were correct the foregoing day are 
now plotted as errors. An exception is made only for the first trial of each retraining 
period, where the result is plotted in terms of both the old and the new curve. The 
left part of the figure representing the main experiments is a repetition of the corre- 
sponding curve in Fig. 1. 

In comparing the starting points of the consecutive curves as emphasized by small 
squares, a definite downward trend can be observed. The animals seem to learn that 
the direction of training might be alternated. A general distrust of the stability, or 
‘conservation,’ of the particular distribution of success and failure seems to become 
established. At the same time there is an improvement from day to day of the success 
reached at the end of each training period of 8 trials starting from the last reversals. 
This is indicated by the encircled points. Both these features together are expressed 
by the increasing upward trend of the last unit of each curve representing the transition 
from the last trial to the first trial of the succeeding reversed period. 

The increasing adjustment to reversal of training also reveals itself in the fact that 
there is a tendency toward increasing steepness of the first link of each of the consecutive 
curves. In Fig. 4 the numbers inserted indicate the progress made from the first to 
the second trial of each phase. Taking the total training there is an increase from I on 
the first to 21 on the last day. It seems as if it would become possible, after having 
made the rats acquainted with the situation, to ‘tell’ them, by means of the first trial, 
in a way almost as generalized as is human language, which side they should choose 
during the day in question. The rats have ‘learned to relearn,’ or to respond to the 
first few trials of a day as a ‘symbol’ for the conditions during the trials to come. A 
discussion of former findings on this problem is given by Krechevsky (4) who reaches 
a conclusion similar to ours. 


SUMMARY 


As a contribution to what might be called the ‘ psycho- 
physics of probability,’ five groups of 48 rats each were re- 
warded on the two sides of a choice situation different pro- 
portions of times (‘main experiments’). A training of 24 
trials failed to establish a discriminatory response in only one 
of the groups, for which the chances of reward on the two sides 
were 2/3 against 1/3 (group ‘67 : 33’). In contrast to that, 
groups ‘100: 50,’ ‘75:25’ and ‘50:0’ were significantly 
above the ‘threshold of probability’ in an increasing order. 
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For the last of these groups, 50 : 0, the difference with contro] 
group ‘100:0’ which represented the traditional unam- 
biguous type of training dropped below significance (‘thresh- 
old of certainty’). 

Discrimination increases with the difference of the prob- 
abilities of success on the two sides, a further influence being 
superimposed due to the ratio of probabilities (tendency 
toward a ‘Weber law of probabilities’). Additional experi- 
ments of the type 75:25 and 100: 50 introducing special 
punishment for each non-rewarded choice (‘danger experi- 
ments’) showed that the increase in discrimination goes with 
the increase of the ratio of the probability of ‘emphasis’ 
as given by punishment or by success, and not with the in- 
crease in the ratio of probability of success, per se. 

Analyses of the factors of isolated frequencies of reward 
or non-reward, of recency, and of the training histories of 
individual animals did not impair the validity of our inter- 
pretations. ‘The question as to the role of perfect insepara- 
bility of a means-end connection as an additional emphasizer 
could not be answered univocally as far as our animals were 
concerned. 

After a reversal of the probability patterns for 16 consecu- 
tive trials, group 75 : 25 dropped below threshold, showing 
as little relearning (or extinction) as group 67 : 33. Likewise 
the readjustment shown by group 50:0 dropped far below 
that of 100 : o. 

Repeated reversal of training type 100:0 showed in- 
creasing promptness of adjustment. 


(Manuscript received January 23, 1939) 
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RETROACTION AND GAINS IN MOTOR LEARNING: 
II. SEX DIFFERENCES, AND A FURTHER 
ANALYSIS OF GAINS 


BY C. E. BUXTON 


Swarthmore College 
AND 
D. A. GRANT 


ae University of Wisconsin 


Sex DIFFERENCES IN GAINS AND INTERFERENCE 


It was shown in a previous paper of this series (2) } that 

: activities interpolated in the learning of a pursuit task have 

a no retroactive effect, insofar as they do not seem to hinder 

3 retention of the skill soas to make the subject, returning to the 

: pursuit rotor after a period of practice at something else, 

| do more poorly than he did previously. The presence of an 

absolute loss of this sort is made impossible by large recoveries 

from ‘work-decrement, or by maturation, or growth, as the 

e phraseology may be, which occur during periods of no rotor 

practice, following first practice. Nevertheless, a ‘relative 

retroaction’ may be demonstrated because certain interpo- 

lated activities are capable of preventing as full a gain or facili- 

tation as will occur if reading fills the interpolated period. 

(Certain other interpolated tasks produce positive transfer 
to the resumed rotor series.) 

The sexes differ in both original skill on these motor tasks 
and the amount of influence which practice and interpolated 
rest or practice periods exert; these differences will be evalu- 
ated first in this paper. The men exceed the women in initial 
and final level of pursuit rotor skill. This is shown in Table I. 
Most of the differences approach statistical significance or are 
significant. However, it is interesting that there seem to be 


1 All results reported here are based on data used for the preceding paper. 
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no sex differences worthy of note on the interpolated 
activities.’ 

Because of the differences in absolute scores for men and 
women, another type of analysis is necessary to show the rela- 


TABLE I 
Sex DirFeERENCES IN Pursuit Rotor SKILL AND IN INTERPOLATED ACTIVITIES 
Variables My | SEm | Mr | SE¢ | CR | Favor- 
Initial Ability 
Final five of first 15 trials 
onrotor,controlgroups | 4.39 41 2.90 | .30 2.93 | 98 M 
Same, meter groups..... 4.00 .48 2.82 -49 1.72 | 91. = 
Same, maze groups......| 4.33 50 2.28 .40 3.20 | 99+ 
Same, spool groups......] 4.32 | .31 2.38 33 4:37. | 99+ 
Final Ability 
Final five rotor trials, 
control groups........ 7.21 47 6.32 33 1.55 | 88 M 
Same, meter groups..... 6.24 | .48 5.15 59 1.44 | 85 ” 
Same, maze groups......| 7.69 58 4.85 56 3-52 | 99+ 
Same, spool groups...... 7.91 30 5.62 52 3.81 | 99+ as 


Interpolated Tasks 


Last five meter trials....| 7.07 31 7.17 


19 
Last five spool trials.... . 27.4 7 27.2 


M 
.20 14 


‘7 


N = 31, male control; 30, female control; 20 each for all other experimental 
groups. 

Maze times and errors are not compared for the sexes because of the necessarily 
arbitrary nature of selection of trials for the comparison—the number of trials varied 
from subject to subject. 


tive changes which occur in each sex during original practice 
and interpolated periods at differing activities. Shown in 
Table II are the mean ratios of gain during the first 15 trials 
on the rotor, when the difference between the last five trials 


? Differences on the first task might occur because of differences in adjustment to 
the situation. It frequently is true that women are more disturbed than men when 
faced with a pursuit rotor, and the differences in scores might disappear after these 
first adjustment problems were overcome. However, the greater gain made by men 
during the first practice on the rotor would carry over to keep them superior to the 
women after the interpolated period. 
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and the first five is the numerator of the ratio, and the first 
five trials are the denominator. 

It is consistently true, except in the case of the spool 
packing interpolated group, that the women gain proportion- 
ately more from their lower starting point during the first 
15 trials, even though they do not gain enough, as shown in the 
previous table, to bring them up to the absolute level of the 
men. Furthermore, women gain proportionately more during 
the rest period for the control groups, and profit propor- 


TABLE II 


Ratios or GAIN OF THE Last Five Rotor TRIALS IN THE First FirtEEN OVER THE 
First Five Triats; Ratios or Gains For Return Five Triats 
Over Last Five Berore INTERPOLATED PERIOD 


Variable Mn | SEm My SE; CR | Shances | Favoring 
Gain During First Rotor Sertes 
ee 1.80 .29 3.25 .gO 1.54 88 F 
2.08 31 3.88 1.09 1.58 88 
2.10 74 2.14 42 05 4. 
re 1.50 36 3.10 59 2.32 98 ” 
Gain, Return Series on Rotor Over Last Five of First Fifteen 
82 10 1.50 21 3.14 99+ F 
1.04 1S 2.40 53 2.47 98 
95 14 |. 2.5% 49 2.27 98 
.98 .20 1.24 87 62 


tionately more from the maze and spool packing practices. 
The meter inhibited full facilitation for both the women and 
men, but apparently not as much for the men. 

In general, then, the women present a picture of somewhat 
greater amounts of change (this statement is reénforced by 
examination of the fatigue curves in Fig. 1, discussed in the 
next section) in performance, and susceptibility to changes in 
experimental routine. This statement must be made in a 
tentative manner. 
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Fic. 1. Complete learning curves for rotor-interpolated (fatigue) groups. Abscisse 
are average lengths of contact per 17 sec. trial. 


FATIGUE AS A DIFFERENTIAL DETERMINER OF GAINS 


Dore and Hilgard (3) have criticized Snoddy’s (5) inter- 
pretation of gains made during rest periods from a mirror 
tracing task. Snoddy postulates two opposed types of 
growth, primary and secondary, to account for his results, 
whereas Doré and Hilgard believe it is possible to explain such 
gains in the conventional terms of work-decrement and 
recovery. It is easily observed that mirror-tracing or pursuit 
learning in themselves provide a very challenging type of 
task and therefore a high level of motivation. The conse- 
quent expenditure of effort may prove tiring to subjects even 
though the actual time spent in practice may be measured in 
only seconds or minutes. This fact lends credence to Doré 
and Hilgard’s viewpoint in the minds of the present writers, 
but we now wish to examine it further. 

An additional experiment, involving the same general 
procedure as that described in the first paper of this series (2), 
was performed, upon what were definitely ‘fatigue’ groups of 
subjects. They, like the other experimental groups in this 
series, began with 15 trials on the pursuit rotor, rested ap- 
proximately a minute, but then did 15 additional trials, rested 
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a minute, and returned for five more trials. The second series 
of 15 trials definitely tired the subjects. Fifteen men and 15 
women, selected in the same manner as those for the first 
experiments, except that they were a slightly older summer 
school group, served as subjects, and their results are now 
compared with the control groups of those experiments. 

It is shown in Table III that a sampling error prevented 


TABLE III 
Group N M SEm CR Favoring 
Initial Ability of Control Groups and Special Fatigue Groups 
Men 
Control..... 31 4-39 
| 15 2.98 57 2.01 96 Cc 
Women 
Control..... 30 2.90 30 
eter... 15 1.85 .26 2.63 99 
Ability on Return to the Rotor for Last Five Trials 
Men 
Control..... 31 7.21 47 
15 6.78 .58 58 44 
Women 
Control. .... 30 6.32 
eee 15 6.94 57 .94 66 R 


close matching of control with experimental group for both 
sexes, on initial ability. In both cases the control group was 
quite superior. However, we may compare initial critical 
ratios with those for the last five trials (the second return) 
after equal amounts of time, one group in each case having 
had additional practice in the meantime, and we note that 
both rotor interpolated groups gained more than the controls, 
although the male experimental group did not gain enough 
to exceed its controls. 


8 


RETROACTION AND GAINS IN MOTOR LEARNING: II — 203 


An examination of Fig. 1 in addition to Table III shows 
the following facts: almost equal gains are made during the 
first rest period by both groups; the female rotor interpolated 
group shows somewhat greater variability in performance and 
a somewhat greater amount of fatigue during the added prac- 
tice on the instrument; during the second rest period, the 
women make a greater gain than do the men. Doré and 
Hilgard are thus to some extent supported in their contentions, 
for the curves look as though the women were gaining in 
‘latent’ skill during the fatigue period, and needed only a 
brief respite to permit them to show their acquired skill. 
Recovery from work decrement factors would be important 
in such a case. 

It would seem an open question, still, as to whether the 
total amount of gain shown is simply recovery from fatigue 
and like influences, for after only 15 trials (Fig. 1), at the 
beginning of our experiments, when the amount of fatigue is 
probably not very great, obvious gains are made in a short 
rest period. Furthermore, the gains in motor learning have 
been shown to go on over relatively long periods of time (¢.g., 
by Travis, 6), and the fatigue-recovery factor as such is 
somewhat difficult to conceive as functioning through these 
longer periods, even though it might be quite important in a 
short experiment (cf. Renshaw, 4). (One could of course 
question that any other process should continue. This point 
will be discussed in the third paper of this series.) A third 
point is that in some of the experimental groups of our first 
experiments (2), subjects were kept busy in the interpolated 
period at tasks which did not correlate at all closely with rotor 
performance but which seem to involve the same general 
musculatures and types of coordinations. These interpola- 
tions therefore should produce effects somewhat similar to 
those of the rotor, in the way of fatigue, but nevertheless 
approximately the same gains are made by such groups (e.g., 
the male meter interpolated) as by control groups, being 
usually somewhat less than the latter. (It is implied in such 
an argument against the work-decrement explanation of gains 
that, if there is positive transfer from the interpolated to the 
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resumed task, there should be at least some sign of correlation 
between them, not the chance size of our coefficients in the 
instances mentioned here.) 

The fundamental problem of just why spaced practice 
should be superior to massed practice is still not satisfactorily 
solved. If recovery from fatigue is only a partial determiner, 
other factors must be found. Reminiscence may be listed as a 
factor, but it perhaps is time to define this concept more 
carefully. As it now is generally used, it is a rather opera- 
tional sort of term, signifying any kind of gain in performance, 
made without rehearsal. The implicit assumptions neverthe- 
less seem to be that recovery from fatigue is not what is meant, 
that changed conditions at the time of recall so as to improve 
performance are not what is meant (although Woodworth, 7, 
has rightly pointed out the significance of this aspect of remi- 
niscence experiments), but that there is some sort of spontane- 
ous change going on in the central nervous system of the 
organism which results in improved performance. We would 
propose that the use of the term be limited to the last named 
phenomena. To put the problem in the language of traces, 
whatever changes occur in traces, once they are laid down, 
is a function of both the changes in the organism in its recovery 
from fatigue, in relaxation at time of recall, etc., and the dy- 
namics of traces themselves considered apart from extrinsic 
determining conditions. We propose that reminiscence be 
used to describe the possible spontaneous changes in the traces 
which might occur by the very nature of the nervous tissue 
in which they are laid down. This does not dismiss the prob- 
lem. Indeed, it makes the nature of reminiscence itself a 
research problem, as Bunch (1) has emphasized. The laws 
governing gains in a rest period thus should be of several 
kinds, dealing with, for example, amount and kind of change 
in traces with recovery from fatigue, amount and kind of 
change in traces with improved conditions of recall, such as 
relaxation, confidence, feelings of familiarity, etc., and amount 
and kind of change in traces themselves with the passage of 
time when the above or other factors are not significant or are 
cooperative. The validity of the hypothesis that traces may 
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of their own nature change to produce improved performance 
(which we propose to call reminiscence) is not yet established, 
but there are many lines of evidence forit. It should be stated 
that, in the above proposal, we do not consider the function 
of reminiscence, if there be such, as something ‘split off’ 
from the rest of the functions of the organism. On the con- 
trary, one of the first possible weaknesses of our proposed 
definition of reminiscence may be that one can cannot think 
of ‘spontaneous’ changes in traces without considering ex- 
trinsic determiners of these changes. The results of our 
experiments indicate that fatigue is not necessarily important 
in determining amounts of gain, although improved conditions 
at time of recall may be something of a factor. Therefore the 


possibility of reminiscence as we have defined it above is 
definite in our results. 


RELATION OF AMOUNT OF GAIN TO THE LEVEL 
oF ABILITY 


The question arises as to whether an individual initially 
superior on the pursuit rotor will gain more or less in a rest 
period than one who is initially poor (the word initially here 
meaning: at the end of a first practice period). In this con- 
nection the control groups of subjects of the first paper in 
this series (2) were arbitrarily divided into high and low halves 
on the basis of the last five trials in the first 15 on the rotor. 
The complete learning curves for these two groups (in each 
sex) are shown in Fig. 2. The values which are of particular 
interest in this figure are summarized in Table IV. In terms 
of absolute gain during the 14 minute rest period, the high and 
low groups in each sex are approximately equal. However, 
when percentages of improvement are computed, we see that 
proportionately more ‘spontaneous’ improvement is made 
by the poorer subjects.*+4 (Referring again to the work- 


* These values are not given more elaborate statistical treatment because the 
means are simply a function of the arbitrary dividing line between ‘good’ and ‘poor’ 
groups. 

‘It is possible that the proportionately greater gains made by the women during 
the rest period are a function of their lower position in ‘the’ learning curve. In other 
words, women and men of equal ability at a given time may gain an equal amount 
if given equivalent rest periods—unless fatigue plays a differential role. This point 


may be studied experimentally by equating groups of male and female subjects under 
various conditions. 
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TABLE IV 
Gains Mave sy Hicu anp Low Hatves (in Last Five or First 15 Triats) or 
Controt Groups Arter Rest Periop (Av. Time 1n SEcs. PER TRIAL) 
Gain in Secs. Percent Gain 
Men 
High: 
10.1 2.85 39 
Low: 
2.11 
ers 4.78 2.67 127 
Women 
High: 
4.80 
7.94 3.14 65 
Low: 
1.31 
4.66 3.35 256 


HIGH MALES 
—LOW 
IT HIGH FEMALES 


3 


TRIALS 


Fic. 2. Gains made during the interpolated period by high and low halves of subjects 
for each sex. 


decrement-recovery interpretation, it seems questionable to 
interpret the differences between the shapes of the curves 
for ‘good’ and ‘poor’ groups in Fig. 2 simply on the basis 
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of greater fatiguability of poor subjects. It well may be that 
other principles, whether Snoddy’s or not, are operating.) 


SUMMARY 


1. The sexes were compared as to their performance upon 
pursuit and other motor learning tasks, and as to the differ- 
ential effects of interpolated periods filled with other motor 
activities or with reading upon retention of skill on the 
original task. 

2. Males exceed females in initial and final ability at pur- 
suit in the present experiment (initial meaning the last five 
of the first 15 trials, and final meaning five return trials after 
14 minutes of interpolated activities) without regard to what 
activity the interpolated period contains. 

3. Women show proportionately greater improvement on 
the original task during the interpolated period, for the control 
group, and possibly more interference in the experimental 
groups (particularly the meter-interpolated group). 

4. In additional experiments, planned like those reported 
in a previous paper, women showed somewhat greater fatigua- 
bility in continuous practice on the pursuit rotor, and made 
proportionately greater gains in the one minute rest period 
between the second series of 15 trials on the rotor and a final 
series of five trials. The results of this experiment are be- 
lieved to lend partial support to Doré and Hilgard’s explana- 
tion of Snoddy’s results on the basis of recovery from work 
decrement. 

§. Subjects ending an initial practice period at a low level 
gain proportionately more during a rest period than those 
having high scores. 

6. Arguments are brought forward to show that recovery 
from work decrement (e.g., fatigue) is but a partial explanation 
of the gains made in motor learning. 

7. The factor of reminiscence is proposed as an added 
(but not the all-inclusive) determiner of the gains. A defini- 
tion of reminiscence is suggested as part of this proposal. 


(Manuscript received January 9, 1939) 
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THE EFFECT OF SPACED LEARNING ON THE 
CURVE OF RETENTION! 


BY LEO F. CAIN 


San Francisco Junior College 
AND ROY DE VERL WILLEY 
San Jose State College 


The present study attempts to offer a partial explanation 
of the factors which are significant in producing the difference 
in retention which follows the learning of a given amount of 
material in three periods of practice as compared with learning 
the same amount in one period. 


Apparatus and Procedure.—Subjects learned twelve three-letter nonsense syllables 
by distributed and massed practice. The anticipation method was used throughout in 
that the subject was required to anticipate each syllable by spelling it aloud within the 
exposure period of the syllable preceding. No credit was given for the correct spelling 
of parts of the syllables. One successful anticipation of the total series constituted the 
final criterion of learning. 

The apparatus consisted of a mechanical memory drum. Syllables were exposed 
at two-second intervals. A six-second interval was given between each trial. Two 
series of nonsense syllables were selected from Melton’s lists and all subjects had a 
practice period before attempting actual learning. Careful instructions were given 
preceding the practice period and the learning periods. The instructions explained 
the nature of the procedure and the importance of such factors as reporting promptly, 
not discussing the experiment with others, and not attempting to recall the syllables 
between the experimental sessions. Before the relearning periods, it was emphasized 
that every possible effort should be made to recall as many syllables as possible on the 
first trial. The complete instructions are on file at the Department of Psychology, 
Stanford University. 

The subjects consisted of graduate and undergraduate students from Stanford 
University. Six groups of ten subjects each were used with the exception of Group VI 
where the results for nine subjects only were treated. The learning of one subject in 
this group was so atypical as to justify his exclusion.? 

The groups were divided as follows: Distributed Practice Groups (Groups I, II, and 
III). These groups met a criterion of 6, 9, and 12 items on successive days. Group I 
relearned to 12 syllables after one day, Group II after three days and Group III after 


1 The experiments were suggested by Professor E. R. Hilgard and were performed 
under his supervision in the Stanford University Psychological Laboratory. 

2 The range of learning trials for other subjects was from 16 to 59. He required 
105 trials. 
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seven days. Massed Practice Groups (Groups IV, V, VI). These groups met the 
criterion of 12 syllables on the first day. Group IV relearned after one day, Group V 
after three days, and Group VI after seven days. 


RESULTS 


In a comparison of the trials required for learning by dis- 
tributed and massed practice, there was very little difference 
in the number of trials required for mastery under the two 
conditions of learning. The total mean trials for the dis- 
tributed practice groups for learning to the criterion was 
35.6 + 2.23 (mean and standard error) and for the massed 
practice groups was 33.5 + 2.01. The amount of recitation 
for both distributed and massed practice was so similar that 
total practice may be considered constant for the two condi- 
tions of learning. 

Table I shows the differences in mean scores between the 


TABLE I 
Recati Scores PERCENT 
Groups Interval tributed Massed | pifference | S.E.ps 
Practice 
faery 1 day 75.9 48.3 27.6 7.6 3.6 
>} 3 days 68.3 21.6 46.7 9.3 5.0 
Sere 7 days 66.6 20.3 46.3 10.1 4.6 
Yor 
Groups Interval | tributed Massed | Difference | S.E.ps 
Practice 
92.1 . 78.9 13.2 7.4 1.8 
80.5 71.2 9.3 6.3 1.5 
Es 86.0 73.9 12.1 7.6 1.6 


* Computed by Fisher’s method for small samples. 


distributed and massed practice groups after intervals of one, 
three and seven days. The recall scores are plotted in Fig. I. 
It will be noticed that after one day the distributed practice 
group was able to recall 75.9 percent of the series, while the 
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massed practice group was able to recall only 48.3 percent. 
The difference of 27.6 percent is statistically significant as it 
shows a ‘t’ value of 3.6.5 The spaced learning group held 
this advantage after every interval of testing. In fact, there 
is a large increase in advantage for this group at the three-day 
interval, the difference here being 46.7 percent. This dif- 
ference was sustained by the group tested after an interval 
of seven days where the difference was found to be 46.3 percent. 
Both of these differences are statistically reliable. The dif- 
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Fic. 1. The curves of retention obtained from the mean recall scores at 1, 3, 
and 7 day intervals following the memorization of 12 nonsense syllables by distributed 
and massed practice. 


ferences in the saving or relearning scores are not so striking, 
although the higher percentages are consistently in favor of 
the distributed practice group. The values are plotted in 
Fig. 2. 

Further study of the data reveals an interesting trend in 
regard to the overnight changes when twenty-four hours 
elapsed between successive learnings and recalls. The dis- 
tributed practice group showed a consistent day-to-day 


3 After Fisher’s method for small samples. For a sample of this size a ‘t’ of 2.9 
is considered satisfactory. 
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Fic. 2. The curves of retention obtained from the mean relearning (saving) 
scores at I, 3, and 7 day intervals following the memorization of 12 nonsense syllables 
by distributed and massed practice. 


increase in the percentage of syllables recalled when, in the 
case of partial learning, the criterion is based upon the total 
syllables learned in the previous practice period. This 
phenomena is particularly apparent during the partial learning 
periods and continues to exist for the final recall period of 
Group I. The relevant data are presented in Table II. 


TABLE II 


SHOWING THE PERCENTAGE OF RECALL OF CRITERION AFTER ONE, Two AND THREE 
Twenty-Four Hour INTERVALS 


Group I Group II Group III Group IV 
After first period........... §2.7 38.7 42.5 48.3 
After second period........ 60.6 59.7 73.2 
After third period.......... 75.9 


Group I, after a twenty-four hour interval, could recall 
$2.7 percent of the six syllables learned at the first session. 
Following the second session, this group could recall 60.6 
percent of the nine syllables learned on the previous day and 
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following the third session could recall 75.9 percent of the 
criterion of twelve syllables. 

This same daily increase is true of Group II and Group III 
in the specific instances in which there was a twenty-four hour 
interval between learning and recall. Group IV, which 
represents the only condition of massed practice in which 
there was a twenty-four hour interval between learning and 
recall, could recall only 48.3 percent after the twenty-four 
hour interval. It should be especially noted that Group IV, 
which learned by massed practice and had its first recall after 
twenty-four hours, had a similar recall score to that of the 
distributed practice groups at their first period of recall. 

In other words, each day’s practice in recall increases the 
recall score. Thus, it appears that practice of recall is ac- 
companied by an increased percentage of recall. It can be 
postulated from the evidence here presented that such practice 
of recall contributes, in part at least, to the advantage of dis- 
tributed practice in recall. 

Turning to a brief consideration of the relearning scores, 
it is noted that there is not the marked difference here be- 
tween the distributed and massed practice groups that is 
found in the case of the recall scores. This result is not 
surprising when the data are carefully considered. As has 
already been shown, similar numbers of trials were required 
for original learning by both methods. Since relearning is 
tested similarly for both groups, and both had had equal 
learning practice, little difference in relearning scores would 
be expected. The fact that spaced practice groups had higher 
saving scores can be attributed to the first trial in each recall 
period. As the distributed practice group had two additional 
experiences of recalling first trials as compared with the 
massed practice group, this advantage is reflected in the 
_ Saving score. 

The data show that the curve of retention is not a con- 
stant function of the passing of time. They confirm Luh’s (2) 
finding that the curve of retention varies with the conditions 
of learning and recall. 

Many hypotheses have been offered to account for re- 
tention and for the superiority of distributed over massed 
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practice. Rehearsal, fatigue, perseveration, and removal of 
inhibition have been proposed. Hovland (1) discusses these 
in some detail. However, as far as the present data are 
concerned, practice in recall is seemingly an important factor. 
The fact that the groups which worked under the conditions 
of spaced learning had more practice in recall is largely 
responsible for the high retention scores. 


SUMMARY AND CONCLUSIONS 


1. This study presents and interprets data which show a 
relationship between conditions of learning and conditions 
of retention. Fifty-nine subjects learned a list of twelve 
nonsense syllables by massed and distributed practice. Re- 
call was tested following both methods after intervals of one 
day, three days and seven days. The anticipation method 
was used throughout and the final criterion was one correct 
anticipation of twelve syllables. 

2. The results show that although there was very little 
difference in the number of trials required for mastery under 
the two conditions of learning, the retention which followed 
distributed practice was higher. This was much more evident 
when retention was tested by recall than by relearning, 
although both methods showed differences consistently in 
favor of distributed practice. 

3. A consideration of the overnight changes shows a pro- 
gressively increasing recall score with each successive attempt. 
As the distributed practice groups had opportunity for more 
attempts at recall, it gave them a distinct advantage over the 
massed practice groups. 

4. Practice of recall is important in the explanation of the 
superiority of distributed over massed practice as obtained 
under the conditions of the present study. 


(Manuscript received January 23, 1939) 
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THE EFFECT OF MILD ANNOYANCE UPON THE 
LEARNING OF VISUAL FORMS 


BY HERBERT GURNEE 


Western Reserve University 


It is known that annoying stimuli such as electric shock 
may facilitate learning when given for right responses in maze 
situations.!:?, Evidently the effect is a function both of the 
degree of shock? and of its temporal relation to the act of 
choice.®: 4:5» 6 ‘Thus while mildly annoying shock for right 
responses tends to facilitate, moderately annoying shock for 
right responses tends to inhibit the elimination of errors; 
and shock after the choice-point accelerates, whereas shock 
before or at the choice-point retards learning. This facilita- 
tion creates a significant psychological problem, since it runs 
counter both to the expectations of common opinion and to 
the requirements of the law of effect (1913 formulation). A 
satisfactory solution must wait upon further investigation 
of the extent and the conditions of its occurrence. Since 
annoyance has thus far been used in situations which involved 
a necessary element of choice, it would seem advisable as a 
next step to try its effect on the learning of material in which 
choice is not a factor. The present article describes such an 
experiment. Specifically, the problem was to observe the 


1Tolman, E. C., Hall, C. S., and Bretnall, E. P., A disproof of the law of effect 
and a substitution of the laws of emphasis, motivation, and disruption. J. Exp. 
Psychol., 1932, 15, 601-614. 

2 Gurnee, H., The effect of electric shock for right responses on maze learning in 
human subjects. J. Exp. Psychol., 1938, 22, 354-64. 

3 Muenzinger, K. F., Electric shock for correct responses in the visual discrimina- 
tion habit. J. Comp. Psychol., 1934, 17, 267-277. 

4 Muenzinger, K. F., and Wood, I., The function of punishment as determined 
by its temporal relation to the act of choice in the visual discrimination habit. /. 
Comp. Psychol., 1935, 20, 95-106. 

5 Hamilton, J. A., and Krechevsky, I., Studies in the effect of shock upon behavior 
plasticity in the rat. J. Comp. Psychol., 1933, 16, 237-253. 

6 Fairlie, C. W., Jr., The effect of shock at the moment of choice on the formation 
of a visual discrimination habit. J. Exp. Psychol., 1937, 21, 662-669. 
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effect of electric shock upon the learning of visual forms, with 
the degree of learning measured by the recognitive method. 


Apparatus and Procedure.—The stimuli were forty meaningless, irregular designs 
drawn in black ink on 5” by 3” white cards. The figures were closed and ranged in 
area from 20 to 30 squarecm. ‘Twenty of the designs were presented initially to the 
subject, and these were then mixed with the twenty others and shown for recognition. 
The recognition method was used because it is most adapted to problems of this 
nature; the method of reproduction tends to introduce more or less uncertainty in the 
determination of right and wrong responses. The same designs were used in the 
initial presentation for all subjects; and the same order was used throughout, so that 
mutual interferences and reinforcements among patterns would be constant. 

Each card appeared for four seconds in the aperture of a white cardboard frame, 
placed on the table between the subject and the experimenter. Presentation was 
manual, with time controlled by a metronome. There was a four second interval 
between successive cards. In the final presentation this was lengthened to six seconds 
. where necessary, and was used to record the subjects response, ‘yes,’ ‘no,’ or ‘doubtful.’ 

Doubtful responses were discouraged. 

The electric shock was given by the secondary of a Dubois-Reymond variable 
inductorium, operated from three dry cells. <A piece of brass tubing held in each hand 
constituted the electrodes. The setting of the inductorium was determined from the 

judgments of a preliminary group of nine subjects. By the method of limits a value 

was obtained for each of these subjects which was reported mildly annoying, and from 
these values a setting was computed which was at least mildly annoying to two-thirds 
of the subjects. 

The regular subjects were 87 male undergraduates. They were divided into three 
groups of 29 members each. One group comprised the controls and received no shock. 
A second group received shock with the first ten of the twenty initial cards, and the 
third group received shock with the last ten cards. This procedure was followed in 
order to get a measure of the degree of similarity of the groups, since it has been shown 
that groups of this size cannot be assumed to be equal. 

The shock came during the last two of the four seconds during which the particular 
card appeared. This gave the subject two seconds in which to respond before annoy- 
ance occurred. Such a procedure seemed advisable in the light of Muenzinger’s 
discovery that reinforcement does not occur unless shock comes after an animal’s 
response has begun. 

After the twenty initial cards were presented, the subject was asked to rate the 
shock on a five-point scale consisting of the following steps: pleasant, neutral, mildly 
annoying, moderately annoying, extremely annoying. 


RESULTS 


The crucial quantitative results are given in the adjoining 
table. The values are the averages and the standard devia- 
tions of the number of correct recognitions of the initial twenty 
stimuli. Reactions to the twenty inserted stimuli are omitted, 
and wrong and doubtful reactions to the initial stimuli will 
be mentioned later. The group labeled Experimental | 
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received shock on initial stimuli one to ten; the group labeled 
Experimental II on initial stimuli eleven to twenty. Nine 
of the experimental subjects, four from I and five from II, 
reported the shock pleasant and their results are accordingly 
not included in the table. The critical ratios (C. R.) are in 
terms of standard errors and are for comparisons of each ex- 
perimental with the control group. A minus sign indicates a 
lower, and a plus sign a higher average response than for the 
control group, that is, a difference in favor of the experimental 
group. 

It will be seen that the shock induces a loss of correct 
responses in both experimental groups. A puzzling circum- 


TABLE I 


AVERAGES AND STANDARD Deviations oF Correct NuMBER OF RECOGNITIVE RE- 
SPONSES TO INITIAL STIMULI 


Stimuli 1-10 Stimuli 11-20 
Av. S.D. Av. S.D. 
Control Group.... 7.389 1.975 6.483 1.584 
re 7.083 1.222 7.750 1.300 
—0.71 + 3.25 
ee 8.364 1.153 6.045 1.261 
+2.37 — 1.15 


stance is the clear superiority of the two latter over the control 
group on the shockless stimuli. One would ordinarily inter- 
pret this difference as an indication of natural superiority of 
the experimental S’s. But another explanation is possible. 
It may be that after-effect of the shock in the first group, and 
expectation of the shock in the second group, tended to pro- 
duce a reinforcement; thus the shock may have induced an 
increased alertness which irradiated to reinforce impression 
of the shockless stimuli. This matter needs experimental 
analysis; and the factor of expectation ought especially to be 
isolated. It may well be, in certain respects at least, that 
anticipation of punishment is more significant for learning 
than the experience of punishment itself. 
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In any event, it seems clear that the shock tended to in- 
hibit the learning of the stimuli which it actually accompanied. 
This is supported by comparisons between experimental 
groups. On the first ten stimuli the shockless experimental 
group has a definite advantage over the shocked group, the 
critical ratio being 3.77. On the last ten stimuli the advan- 
tage of the shockless group is even greater, the critical ratio 
being 4.61. Since the shocks were reversed for the two 
groups, this difference cannot be a result of sampling. 

There were many more wrong than doubtful responses 
for all groups, and shock increased the former rather than the 
latter. Thus the control group averaged 0.66 doubtful re- 
sponses when means of all stimuli are combined and averaged; 
whereas the experimental groups on the shock stimuli averaged 
0.65 doubtful responses. 

Another way of presenting the data is in terms of the 
number of subjects who gain or lose in consequence of the 
shock. Because of the positional factor, the two experimental 
groups are combined. Altogether, 31 of these subjects reg- 
ister more correct responses for the shockless than for the 
shocked stimuli, whereas only 8 give the opposite result. 
Moreover, the former S’s have a greater average than the 
latter, the respective values being 2.47 and 1.14. 

The data thus far are in harmony with the law of effect, 
although this does not mean that they give indisputable proof 
to the law. Were annoyingness the only consequent of the 
shock such an interpretation would be required. But the 
shock may be also a distraction. Indeed the verbal reports 
mention the latter much more often than the former. Be- 
sides, responses of the nine subjects who reported the shock 
pleasant show the same trend as those who reported it annoy- 
ing, whereas by the law of effect they should have given op- 
posite results. Thus their average of correct responses was 
7.50 in the shocked trials, compared to 7.66 in the shockless 
trials. This group is too small to be given any weight, never- 
theless it does render questionable as yet any definite ex- 
planation in terms of annoyingness. The need for a systema- 
tic attack upon this aspect of the problem is indicated. 
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DIscuUSSION 


The above findings give rise to a curious scientific predica- 
ment, namely, that the same degree of electric shock retards 
learning in one situation and facilitates it in another. How is 
this contradiction to be resolved? ‘There are several possible 
answers. 

1. It may be suggested that different kinds of learning are 
involved; that the acquisition of visual form is functionally 
dissimilar from the acquisition of positional habits such as are 
required in the maze; and that what therefore affects the one 
does not similarly affect the other. However, such an ex- 
planation is too far-fetched and logically indefensible to apply 
here, if indeed it can be applied anywhere. To postulate 
learning functions so dissimilar as to yield opposite results 
would seem to be absurd. 

2. The relation of the shock to the situation is different in 
the two cases. That this is true is clear upon analysis. In 
the maze or discrimination box shock comes to the subject 
as a related part of the situation. It belongs; and whatever 
he does in or to the situation affects his experience of that 
shock. On the other hand, in circumstances such as those of 
the present study, the shock comes as something not func- 
tionally related to the core or central Gestalt of the situation. 
It does not belong. Rather it pertains to the background, 
like other irrelevant items of the total field. Consequently 
to the extent to which it intrudes (and its affective value makes 
it compelling) it interferes with the dominance of the figure, 
acts as a competitor rather than an associate, and thus retards 
impression. | 

If this explanation is correct, it suggests that Thorndike’s 
law of belonging ought to be extended, and the law of effect 
should then be made corollary to it. 

3. Another possible explanation is that the distinction 
may be in the temporal relation of the shock and the act. 
It has been reported that shock at the moment of choice is 
disruptive of learning in discriminative situations,® while shock 
after choice is facilitating.’ 4 In the present case, there is no 
choice as that term is ordinarily used; nevertheless the be- 
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havior may be functionally similar, since the subject is in the 
process of selecting distinguishing or characteristic features 
of the object, and the attentive act is interrupted before it 
has completed the adjustment. For light on this point, the 
experiment ought now to be done with the shock coming just 
after the stimulus is removed. 


SUMMARY 


Three groups of human subjects, one control and two 
experimental groups, were presented a series of twenty non- 
sense designs. One experimental group received mildly 
annoying electric shock with the first ten designs, the other 
experimental group with the last ten designs. The shock 
decreased the average number of correct recognitions. The 
fact does not require explanation in terms of the law of effect 
as there is some reason to believe the result was a function of 
distraction rather than annoyance. The findings are contrary 
to those in which the same degree of shock accompanied right 
responses in maze situations, and it is suggested that the 
difference is a result of a different relation of the shock to the 
situation. An extension of Thorndike’s law of belonging 
seems to offer the most reasonable explanation. 


(Manuscript received January 21, 1939) 
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A NOTE ON A TEMPORAL GRADIENT 
OF REINFORCEMENT} 


BY JUDSON SEISE BROWN 
Department of Psychology, Institute of Human Relations, Yale University 


Hull’s goal gradient hypothesis (3, p. 26) states that 
. . . the goal reaction gets conditioned the most strongly 
to the stimuli preceding it, and the other reactions of the 
behavior sequence get conditioned to their stimuli progres- 
sively weaker as they are more remote (in time or space) ? 
from the goal reaction.” * ‘The applicability of this hypoth- 
esis to spatially conditioned behavior sequences has been 
validated, in part at least, by previous experimental findings 
(1,4, 10,11). If the principle of a gradient of reinforcement 
holds in time as well as space, one would predict that a re- 
sponse which has been elicited recurrently at a particular 
interval of time would tend to be elicited with maximal 
strength at that interval, and would be elicited in a progres- 
sively weaker manner at other points more remote in time 
from that interval. The present experiment was designed 
to test this prediction and thereby to secure evidence concern- 
ing the applicability of the hypothesis of a gradient of rein- 
forcement to temporally conditioned behavior sequences.* The 
general procedure employed was as follows: rats were trained 
by being shocked repeatedly at intervals of 12 seconds, the 
magnitude of their jump-responses to shock being recorded; 


1 The author wishes to express appreciation to Drs. O. H. Mowrer, N. E. Miller 
and D. G. Marquis for valuable suggestions and criticisms. 

2 Italics by the author. 

3 Following the suggestion of Miller and Miles (6), Hull (5) has indicated that 
since the ‘goal’ is the point of reinforcement, a better term for ‘goal gradient’ would 
be ‘gradient of reinforcement.’ 

‘Since points in time are not, strictly speaking, stimuli to which responses may 
become conditioned, it is probable that the responses in the above situation become 
conditioned either to specific segments of relatively stable action patterns occurring 
sequentially in time (Guthrie, 2, pp. 53-55) or to particular sections of hypothetical 
stimulus traces (Pavlov, 9, p: 43). 
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the rats were then given test shocks at 12 seconds and at in- 
tervals shorter and longer than 12 seconds in order to deter- 
mine whether or not the reinforcement given at the 12 second 
interval had spread to other points in time. 


Apparatus and Procedure —The apparatus was a modified form of a ‘rat sta- 
bilimeter’ developed by Dr. O. H. Mowrer. The platform of a postage scale (4 |b. 
capacity), critically damped by a plunger in an oil filled cylinder, was fitted with a 
grid of no. 20 stainless steel wire stretched across an air gap at intervals of 3gin. The 
grid was surmounted by a removable, bottomless box measuring 7 X 2} X 5} in. 
During training and testing, the rats were placed one at a time on the grid and the 
box fastened down around them. Under these conditions, the jumps made by the 
rats in response to shock caused the platform of the scale to move up and down. 
These vertical movements of the scale platform were magnified by mechanical and 
optical levers and recorded photographically by means of a slit camera. Thus the 
magnitude of a rat’s response to shock could be accurately measured. A Warren 
Telechron motor (1 r.p.s.) fitted with a commutator arm and contacts was connected 
in series with a mercury switch, the stainless steel grid, and the output of a modified 
Muenzinger-Walz (7) constant-current shocking device. By closing the mercury 
switch, momentary shocks could be administered at any desired interval in multiples 
of one second. An auxiliary switch was provided by which the strength of the shock 
could be shifted from 0.5 (ma.) to 0.4 ma. 

Thirty-two rats,® equally divided as to sex, were trained in the following manner. 
Each rat was placed on the grid and given 35 training shocks of 0.5 ma. at intervals 
of 12 seconds. The last five jump-responses made to these shocks were recorded. 
Following the training shocks, each rat was tested by being shocked at each of the 
following intervals: 3, 6, 9, 12, 15, 18, 21, and 24 seconds, though not necessarily in 
that order. For these tests a slightly weaker shock of 0.4 ma. was used. Serial errors 
were eliminated by systematically varying the order of presentation of the test inter- 
vals. Likewise, the possibility of disrupting the established temporal rhythm by the 
interpolation of irregular test intervals was minimized by giving five training shocks 
(12 second intervals) after each of the above tests. All jumps made in response to 
test shocks and interpolated training shocks were recorded. The above training and 
testing procedure may be illustrated by presenting one of the 16 different test orders 
employed: (1) Thirty-five training shocks of 0.5 ma. at 12 second intervals; (2) one 
test shock of 0.4 ma. 15 seconds later; (3) five training shocks at 12 second intervals; 
(4) test shock 12 seconds later, followed by groups of five training shocks alternated 
with test shocks at intervals of 9, 6, 3, 24, 21, and 18 seconds. 


RESULTS 


The results of the experiment are presented graphically in 
Fig. 1. The absolute magnitudes of the responses made at 
each test interval were not used directly in determining the 
ordinate values because of wide individual differences among 


5 The record of one rat was omitted from the data presented below because its 
jumps were so great as to drive the recording light beam off the camera slit. 
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the rats in weight, strength, and sensitivity to shock. Hence 
the procedure given under Fig. 1 was adopted. Although 
median values have been plotted in Fig. 1, the curve obtained 
by plotting the means was identical with it save for slight 
displacements of points 6 and 9 due to extreme scores. 
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NO. OF SECONDS BY WHICH TEST SHOCKS 
FOLLOWED LAST TRAINING SHOCK 


Fic. 1. Showing the relation between magnitude of response and the temporal 
interval at which the response is evoked. The values on the abscissa are the eight 
different temporal intervals at which test shocks were given. The values on the 
ordinate were determined by first expressing the strength of each rat’s test jumps as 
percentage values of the mean strength of all his training jumps. The median scores 
of the sets of values thus obtained for the different test intervals are given here. 


From this figure it is evident that the strength of the 
tendency to respond to shock by jumping increases progres- 
sively up to the 12 second point (the training interval), drops 
rather sharply at 15 seconds, increases again to the 21 second 
point and drops again at 24 seconds. The initial rise from 3 
to 12 and the drop from 12 to 15 are both highly reliable, the 
difference between the mean values of the former being 3.25 
times the standard error of the difference, that between the 
mean values of the latter being 3.49 times its standard error. 

The increase in response magnitude from 3 to 12 seconds 
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clearly substantiates our original assumption that a gradient 
of reinforcement may apply to temporally as well as spatially 
conditioned behavior sequences. Although the gradient is 
not symmetrical on both sides of the 12 second point, the gen- 
eral tendency for the strength of response to diminish with 
distance (in time) from the point of reinforcement, both 
forward and backward, is suggested by the data. In order 
to be conclusive, however, a demonstration of the bi-direction- 
ality of the temporal spread of reinforcement would require 
the introduction of test points between 12 and I5. 

It is interesting to note that the strength of the response 
tendency at 15 seconds, 1.¢., three seconds after the time when 
a regular training shock might have been received is weaker than 
that at every other point save for point 3, which followed an 
actual training shock by 3 seconds. The suddenness of this 
decrease in strength of response immediately following points 
of either actual or ‘expected’ reinforcement, is not deducible 
from the simple principle of a gradient of reinforcement, since 
such a principle would indicate the existence of an approxim- 
ately bi-symmetrical spread of reinforcement. A number of 
alternative hypotheses for dealing with this phenomenon 
might be suggested, the most probable one being that in a 
temporal series of this sort the temporal interval plus the shock 
become conditioned stimuli for a response of relaxing, which 
is of course incompatible with jumping. In the absence of the 
shock, the temporal interval by itself might well evoke such 
a relaxation response. 

The rise from 15 to 21 appears to constitute a second 
gradient of reinforcement conditioned to an interval double 
that used in training. Here, however, the peak occurs prior 
to the 24 second point, suggesting that probably the rat’s 
ability to discriminate is less accurate at the longer intervals 
and hence the point of reinforcement tends to be anticipated. 
The reliability of the difference between the means of points 
15 and 21 is indicated by a critical ratio of 2.91. The drop 
from 21 to 24 is not highly reliable, the critical ratio being 1.55. 

The results of the present experiment show an interesting 
parallel to the work of other investigators. Muenzinger, 
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Dove, and Bernstone (8) have advanced evidence for the 
existence of a bi-directional gradient in space which is some- 
what similar to that we have found to exist in time. Since 
space and time are obviously quite inseparable in the maze 
situation, it seems probable that their results might be due, in 
part at least, to a temporal spread of reinforcement under- 
lying the spatial factors involved. 

Thorndike (12) has shown that reinforcement tends to 
spread in a bi-directional fashion from the rewarded item of a 
serial response sequence to other non-rewarded items, evidence 
which further supports the notion of a temporal gradient of 
reinforcement. In his experiments the non-reinforced points 
cannot be described in terms of their spatial distance from the 
reinforced points. They can, however, be ordered with 
reference to their temporal distance from such points. Pavlov 
(9, p- 41) has demonstrated that if a reflex has been condi- 
tioned to a sound plus a specific temporal interval, presenta- 
tion of the sound at intervals less than that used in training 
will evoke the conditioned reflex with increasing magnitude as 
the customary training interval is approached. Furthermore, 
his results support our observation that the magnitude of 
response drops off with extreme rapidity at points in time 
immediately following a reinforced training interval. Using 
a 30 minute interval, he found that the presentation of the 
conditioned stimulus five minutes after a regular feeding 
failed to produce any alimentary conditioned reflex. But if 
the conditioned stimulus were presented five minutes before 
a regular feeding, a strong conditioned reflex would be evoked. 

It should be noted in concluding, that in the present 
experiment no external conditioned stimulus was applied. 
Hence an interpretation of the phenomenon in terms of strict 
stimulus-response principles would necessitate the postulation 
of some sort of internal states, stimulus traces or action 
sequences to which the response might become conditioned. 
Since the unconditioned stimulus (shock) was always used 
to evoke the response, it seems probable that the differences 
in magnitude of response at different test intervals were due 
to the superposition of a conditioned response upon the un- 


~ 
4; 
7 
ty 
Be 
| 
% 


226 JUDSON SEISE BROWN 


conditioned response. By virtue of the initial training, this 
superimposed conditioned response, being reinforced only at 
the training interval, reached its maximal strength at that 
interval. From this point, the response tendency spread to 
other non-reinforced intervals. 


SUMMARY 


1. Thirty-two rats, equally divided as to sex, were placed 
one at a time into a stabilimeter-like device capable of register- 
ing the strength of a rat’s jump-response to shock. A 12 
second rhythm was set up in these animals by giving them 
35 shocks of 0.5 ma. at intervals of 12 seconds. The rats 
were then tested at the 12 second interval and at intervals 
shorter and longer than 12 seconds, a weaker shock of 0.4 
ma. serving as the test stimulus. 

2. The results of these tests indicate that the learned 
tendency to respond at the 12 second interval, over and above 
the unconditioned tendency to respond to shock at any time, 
tends to generalize or spread to other temporal intervals. 

3. The strength of this generalized response tendency 
increases as the temporal distance from the next point of 
reinforcement in the series decreases. This increase in re- 
sponse magnitude tends to substantiate the hypothesis that a 
gradient of reinforcement may exist in time as well as space. 

4. The strength of the generalized tendency drops sharply 
after a customary point of reinforcement has been passed, 
regardless of whether an actual reinforcing stimulus has been 
presented at that point. 

5. Evidence is presented which indicates the existence of 
a second temporal gradient of reinforcement appearing at an 
interval approximately double that used in training. 

6. The results are discussed and the suggestion made that 
the concept of a temporal gradient of reinforcement may be 
useful in clarifying the results of certain spatio-temporal 
experiments where spatial factors alone have previously borne 
the burden of explanation. | 


(Manuscript received January 17, 1939) 
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RETROACTIVE INHIBITION: THE TEMPORAL 
POSITION OF INTERPOLATED ACTIVITY 


BY E. DONALD SISSON 
Louisiana State Unio. 


In previous articles! on the subject of retroactive inhibi- 
tion, the writer suggested that the effect may be largely 
accounted for by ‘confusion at the time of recall between the 
memorial organizations of two learned tasks.’ This is, of 
course a form of transfer theory. Any factor favoring inter- 
list confusion would enhance the retroactive effect. Any 
factor, on the other hand, helping to isolate the two tasks 
would reduce this effect. One such factor is the temporal 
relation of the two activities. 

The previous work ? on the temporal position of the inter- 
polated activity, relative to the original learning, is incon- 
clusive. Miller and Pilzecker* claimed that the closer to- 
gether the two activities in time, the greater is the retroactive 
effect. This was, of course, evidence in support of their 
perseveration theory.’ Robinson and McGeoch and Nolen ° 
found no differences with different temporal relations between 

1 Sisson, E. D., Retroactive inhibition: the influence of degree of associative value 
of original and interpolated lists. J. Exper. Psychol., 1938, 22, 573-580; Retroactive 
inhibition; serial versus random order of presentation of material. J. Exper. Psychol., 
1938, 23, 288-294. 

2 For summaries of the literature on retroactive inhibition the reader is referred to 
Britt, S. H., Retroactive inhibition: a review of the literature. Psychol. Bull., 1935, 


32, 381-440; McGeoch, J. A., Learning and retention of verbal material. Psychol. 


Bull., 1934, 31, 381-407. 

3 Miller, G. E., and Pilzecker, A., Experimentelle Beitrage zur Lehre vom Gedact- 
niss. Zsch. f. Psychol. u. Physiol. d. Sinnes., 1900, Erganzungsband, 1. 

4 For a discussion of theories of retroactive inhibition see Britt, S. H., Theories of 
retroactive inhibition. Psychol. Rev., 1936, 43, 207-216. 

5 Robinson, E. S., Some factors determining the degree of retroaction. Psychol. 
Monog., 1920, 28, No. 128. 

6 McGeoch, J. A., and Nolen, M. E., Studies in retroactive inhibition: IV. Tem- 
poral point of interpolation and degree of retroactive inhibition. J. Comp. Psychol., 


1933, 15, 407-417. 
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original interpolated materials, while Skaggs 7 obtained results 
in line with those of Miller and Pilzecker. 

There are logically three positions which the interpolated 
task may take in relation to the original task: (1) the inter- 
polated task may be introduced immediately after learning of 
the original activity; (2) immediately before relearning of 
the original activity; or (3) it may occupy a position some- 
where between these two points. According to our hypo- 
thesis, both positions (1) and (2) would offer greater oppor- 
tunity for inter-list confusion than should a position half way 
between original learning and relearning. McGeoch and 
Nolen,® using only positions (1) and (2), 1.¢., immediately 
after original learning or immediately before relearning, found 
no differences in the decrements produced by these two posi- 
tions. Both Robinson ® and Skaggs ® employed the inter- 
mediate positions but their total interval between learning 
and relearning was probably so brief ™ as to minimize any 
differential effects. 

The experiment to be reported used a total interval of 96 
hours with three different positions for the interpolated 
activity; (1) immediately after original learning, (2) 48 hours 
after original learning, and (3) immediately before relearning. 

Procedure.—The experiment involved the learning and relearning of two ten-word 
lists of two-syllable adjectives, the words in the second or interpolated list being 
synonyms of the words in the first or original list and in the same serial order. The 
words were printed in letters one inch tall and were presented to groups of subjects by 
means of a large motor-driven memory drum at the rate of three seconds per word. 
The words were large enough to be easily read by subjects in all parts of the room. 


Five presentations of the lists were given, and learning was tested by the method of 
retained members. The subjects attended to the words for five complete presentations 


7 Skaggs, E. B., Further studies in retroactive inhibition. Psychol. Monog., 1925, 
34, No. 161. 

8 McGeoch, J. A., and Nolen, M. E., op. cit. 

® Robinson, E. S., op. cit. 

10 Skaggs, E. B., op. cit. 

11 Robinson used an interval of 20 minutes and Skaggs one of only ten minutes. 

12 Skaggs (A discussion of the temporal point of interpolation and degree of retro- 
active inhibition. J. Comp. Psychol., 1933, 16, 411-414) has criticized the use of 
interpolated material similar to original material in studies of temporal aspects of 
interpolated activity. His reasoning seems to the present author to beg the question 
in favor of the perseveration theory. Synonyms were used because they yield large 
amounts of retroaction in experiments of this sort. 


& 
‘ 
Ae 
5, 
3 
A 


230 E. DONALD SISSON 


of the list and were then instructed to write down as many of the words as they could, 
regardless of serial position. A period of 90 seconds was allowed for this. Recall 
after the interval was tested in the same way with the exception that the initial word 
of the list was given as a cue. 

Three conditions were studied," a different group of subjects being used for each 
condition. In condition A a second list was presented in the same way immediately 
after learning of the first list. In condition B, the second list was introduced 48 hours 
after the first list and in condition C, immediately before the delayed recall of the first 
list. ‘The numbers of subjects in the three groups were 48, 71, and 50 respectively, all 
being members of the introductory Psychology class. During the intervals, the sub- 
jects followed the regular academic routine. Precautions were, of course, taken to 
forestall rehearsal of the lists. 


RESULTS 
The mean numbers of words written by the subjects during 


immediate recall under the various conditions studied are 
presented in Table I. These values are in fairly close agree- 


TABLE I 
Mean NuMBER OF Worps WRITTEN DurING IMMEDIATE RECALL 
Condition Original Interpolated 
N. Mean om Mean ¢m 
A 48 9.208 12 9.667 .09 
B 71 9.535 10 9.662 .08 
C 50 9.520 12 9.520 II 


ment. None of the values for original learning are reliably 
different, and the same is true for the values for the inter- 
polated lists.* The values for interpolated learning show 
what appears to be a positive transfer or practice effect. The 
amount of this effect varies directly with the temporal prox- 
imity of the two learning activities being completely absent 
in condition C. | 

The mean numbers of words recalled after the interval are 


13 This method was used because it seemed better than merely asking for ‘the first 
list’ and ‘the second list’ etc. The recall scores are, of course, somewhat higher than 
they might have been otherwise. 

14 Since we were not attempting to prove the existence of retroactive inhibition, 
but merely to compare the amounts occurring under the different conditions employed, 
no rest condition was included. 

16 The critical ratios (D/¢ d) are: original lists, A-B—2z.10; A-C—1.83; B—C—.o9; 
interpolated lists, A-B—.o4; A-C—1.05; B—C—1.o1. 
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presented in the second table. We will consider the values 
for the interpolated lists first. The mean number of words 
recalled increases from condition A to C, which is, of course, 
to be expected on the basis of decreasing intervals since learn- 
ing. An interesting point is revealed by a comparison of 
these interpolated values with the original values. In the 
B condition the value for the interpolated list is lower than 
that for the original list whereas the interval since learning is 
twice as great for the latter. This difference is large and 


TABLE II 
Mean NuMBER OF Worps WritTEN Durinc Detayep RECALL 
Condition Original Interpolated 
N. Mean om Mean em 
A 48 3.145 21 3.312 .23 
B 71 6.282 .28 4.493 25 
50 5.120 .29 8.500 21 


significant according to the statistical criterior (Critical 
Ratio—4.71). No attempt at an explanation of this dif- 
ference is made here, except to suggest that it may be due to 
interference with the recall of the second list following recall 
of the first. This is, of course, only a conjecture. If true, 
the difference should not be so marked where the two tasks 
are more dissimilar. 

Turning now to the values for the recall of the original 
lists, we note important differences. As shown in Table III, 
all of these differences satisfy the requirements of statistical 


TABLE 


CriticaL Ratios (D/o d) or THE DiFFERENCES BETWEEN THE VARIOUS ConDITIONS 
IN THE RECALL OF THE ORIGINAL Lists 


Differences _ Critical Ratios 
8.96 
5.51 
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reliability..6 The B condition is more favorable to recall than 
either of the other conditions. This is of course in line with 
the hypothesis presented at the beginning of the paper. 
Furthermore, the C condition yields greater returns than 
condition A. This is not in line with the results of McGeoch 
and Nolen?’ who found no differences when the second list 
was presented immediately after learning or immediately 
before relearning of the first list. 


DIscussION 


The purpose of this experiment, as stated at the beginning 
of the paper, was to further test the hypothesis that retro- 
action is a result of confusion, at the time of recall, between 
the memory traces or substrates of two learned activities. 
It was assumed that where both tasks were learned at the 
same time, or where learning of one and recall of the other 
occurred at the same time there should be more confusion than 
where experience with the two activities were separated by in- 
tervals of time. In other words, temporal similarity is to be 
considered as well as similarity in content, meaning, operation, 
etc. According to this hypothesis, condition B in this experi- 
ment should have yielded a greater percentage of correct 
recall than should conditions A or C. This was found to be 
true, and to this extent, it adds further weight to the transfer 
theory of retroaction as represented here. 

As a serial list is learned, associations are formed between 
members of the list, especially adjacent members. If two 
lists are learned, associations are also formed between them, 
especially if the two tasks are similar in some way; in content, 
meaning, operation, and time of experience. The more and 
stronger these cross bonds, the greater the confusion at recall 
and thus the greater the amount of retroaction. In condition 


16 Since small yet unreliable differences exist between the values for original learn- 
ing, we have also computed relative recall scores according to the formula: 
delayed recall scores 
immediate recall scores 


100 X 


These values, for the three conditions, A, B and C, are 35.25, 65.88 and 53.78, re- 
spectively. 
17 McGeoch, J. A., and Nolen, M. E., op. cit. 
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B, circumstances were such that fewer of these cross associa- 
tions were formed, consequently, the serial associations pre- 
dominate, yielding more nearly correct recall. Incondition A, 
since both lists were experienced at the same time, strong cross 
bonds would form, leading to inter-list confusion. Moreover, 
both lists offer about the same degree of integration, having 
been learned to the same degree and having suffered the same 
interval of disuse. In this respect condition C differs. Here 
the second list is fresh and well integrated, while the traces of 
the first list are presumably weak. With one list, well 
organized as a unit, there is less chance of inter-list transfer 
than if both are unorganized. This would explain why retro- 
action varies inversely as the number of presentations given 
the original or interpolated material.!® And it would account 
for the larger number of correct recalls under the C condition 
than under A. 


SUMMARY 


Three groups of subjects learned a list of ten two-syllable 
adjectives, followed by a second list, composed of synonyms 
of these words. The variable studied was the temporal 
position, of the interpolated activity. Three positions were 
studied: immediately after original learning, halfway between 
original learning and recall, and just prior to recall of the 
original list. The interval from learning to recall was 96 
hours in length. 

(1) The position halfway between learning and recall of 
the original list resulted in the least amount of retroaction. 

(2) The introduction of a second list just prior to recall 
yielded less retroaction than when given just after learning 
of the first list. 

(3) These results may be explained in terms of confusion 
between constellations of associations and memory traces. 


(Manuscript received January 12, 1939) 


18 McGeoch, J. A., The influence of degree of learning upon retroactive inhibition. 
Amer. ]. Psychol., 1929, 41, 252-262; The influence of degree of interpolated learning 
upon retroactive inhibition. Amer. J. Psychol., 1932, 44, 695-708. 
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